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ABSTRACT: 

Let G be a finite graph or an infinite graph on which acts with finite fundamental domain. If 
G is finite, let T be a random spanning tree chosen uniformly from all spanning trees of G; if G is 
infinite, methods from [Pern] show that this still makes sense, producing a random essential spanning 
forest of G. A method for calculating local characteristics (i.e. finite-dimensional marginals) of T from 
the transfer-impedance matrix is presented. This differs from the classical matrix-tree theorem in that 
only small pieces of the matrix (n-dimensional minors) are needed to compute small (n-dimensional) 
marginals. Calculation of the matrix entries relics on the calculation of the Green's function for G, 
which is not a local calculation. However, it is shown how the calculation of the Green's function may 
be reduced to a finite computation in the case when G is an infinite graph admitting a Z d -action with 
finite quotient. The same computation also gives the entropy of the law of T. 

These results are applied to the problem of tiling certain lattices by dominos - the so-called dimer 
problem. Another application of these results is to prove modified versions of conjectures of Aldous [A12] 
on the limiting distribution of degrees of a vertex and on the local structure near a vertex of a uniform 
random spanning tree in a lattice whose dimension is going to infinity. Included is a generalization of 
moments to tree-valued random variables and criteria for these generalized moments to determine a 
distribution. 
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1 Introduction 



We discuss spanning trees and domino tilings (perfect matchings) of periodic lattices. To define these 
terms, let S = {1, ... , k} be a generic fc-element set and let G be a graph whose vertex set is S d x k, 
i.e. its vertices are all pairs (x, i) where x = (x\, . . . , Xd) is a vector of integers of length d and i is an 
integer between 1 and k. We will usually allow G to denote the vertex set 7L d x S, since this causes no 
ambiguity. We say that G is periodic if its edge set is invariant under the natural ^ d -action; in other 
words we require that (x,i) is connected to (y,j) (written (x,i) ~ (y,j)) if and only if (0,i) ~ (y — x,j). 
Assume throughout that G is connected and locally finite. By periodicity there is a maximal degree D of 
vertices of G. Graphs that we consider may have parallel edges, in other words more than one edge may 
connect the same two vertices. It is convenient to add self-edges - edges connecting a vertex to itself - 
until all vertices have degree D; such a graph is called D-regular. Adding self-edges does not alter any 
of the problems we address, so we assume throughout that all vertices of G have degree D. It will also 
be convenient to assume that simple random walk on G is aperiodic. Since this is true whenever G has 
a self-edge, we assume the presence of at least one self-edge. 

A spanning tree of any graph is subcollection of the edges having no loops, but such that every pair 
of vertices is connected within the subcollection. A loopless subgraph that is not necessarily connected 
is called a forest, and a forest in which every vertex is connected to infinitely many others is called an 
essential spanning forest. It is shown in [Pern] that the uniform measures on spanning trees of a cube of 
finite size n in the integer lattice "Z, d converge weakly as n — + oo to a measure /i^d on essential spanning 
forests of 7L d . This measure chooses a spanning tree with probability one if d < 4 and with probability 
zero if d > 5. The first purpose of the present work is to show how the finite dimensional marginals and 
the entropy of the limiting measure [i^d may be effectively computed. Since the computations may be 
carried out in the more general setting of periodic lattices, and since some of these (e.g. the hexagonal 
lattice in the plane) seem as interesting as 7L d from the point of view of physical modelling, we treat 
the problem in this generality The methods of [Pom] extend to the case of arbitrary periodic graphs 
to show that [Iq chooses a spanning tree with probability one if d < 4 and zero if d > 5. We will not 
re-prove this result in the more general setting, since that would involve a completely staight-forward 
but lengthy redevelopment of the theory of loop-erased random walks [La2] for periodic graphs. 

A domino tiling of a graph is a partition of the vertices into sets of size two, each set containing 
two adjacent vertices. Domino tilings on % 2 have been studied [Kas] and the exponential growth rates 
of the number of tilings of large regions with various boundary conditions have been calculated. The 
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growth rates for domino tilings are different for different boundary conditions [Kas, TF, Elk]. The second 
purpose of this work is to exhibit the domino tiling of maximal entropy for each periodic lattice in a 
special class (that includes K 2 ), and to compute the entropy. We exploit a general version of a known 
connection between domino tilings and spanning forests, so that this follows more or less immediately 
from the results on spanning forests. The correspondence also gives a way to calculate probabilities of 
various contours arising in a uniform random domino tiling. The first few of these are calculated by 
Fisher [Fil, Fi2] using Pfaffians. 

The method we use to calculate the f.d.m.'s of fie is to calculate the Green's function for G and then 
to write the f.d.m.'s as determinants of the transfer-impedance matrix, which is a matrix of differences 
of the Green's function. The main result on transfer impedance matrices is stated and proved in Section 
4 (Theorem 4.2) in the general setting of periodic lattices. Since the result is interesting in itself and is 
for the rest of the paper sine qua non, we state here a simplified version for finite graphs. 

Theorem 1.1 Let G be a finite D-regular graph. Fix an arbitrary orientation of the edges of G and for 
edges e = xy and f of G define H(e, f) to be the expected signed number of transits of f by a random 
walk started at x and stopped when it hits y (this can be written as a difference of Green's functions). 
For edges ei, . . . , e& let M(ei, . . . , eu) denote the matrix whose i,j-entry is H{a 1 ej). If T is a random 
spanning tree, uniformly distributed among all spanning trees of G, then 

P(ei,...,e fc e T) =detM(ei,...,e fc ). 

□ 

The idea of a transfer-impedance matrix is not new, the terminology being taken from [Wei]. We have 
not, however, been able to find the key result (Theorem 4.2) on determinants of the transfer-impedance 
matrix stated anywhere. Furthermore, extending results about transfer-impedances from finite graphs 
to infinite graphs is not immediate (at least when simple random walk on the infinite graph is transient) 
and requires an argument based on triviality of the Poisson boundary. For these reasons, we include a 
derivation of all results on transfer-impedances from scratch. 

The rest of the paper is organized as follows. The next section contains the notation used in the rest 
of the paper and a derivation of the Green's function for a periodic lattice. Section 3 contains lemmas, 
such as a discrete Harnack's principle, about simple random walks on periodic lattices. Rather than 
providing detailed proofs, we include in an appendix the outline of a standard proof for the case G = TL d 
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and indicate the necessary modifications for arbitrary periodic lattices. The connection between simple 
random walks and spanning trees is documented in [Pern] ; the main result that will be used from there 
is that P(e G T) is determined by certain hitting probabilities, but the reader desiring more details 
may also consult [A12] or [Bro]. Section 4 uses these lemmas to show that the Green's function is the 
unique limit of Green's functions on finite subgraphs and that it in fact determines the f.d.m.'s for /iq 
via determinants of the transfer-impedance matrix. Section 5 considers two examples. The first is the 
case G = H? , which is special because the Green's function for general periodic lattices is given by a 
definite integral which is only explicitly cvaluable when G — H? . The second is the high dimensional 
limit of G — fflj d ,d — ► oo, which converges in a sense to be defined later to a critical Galton- Watson 
Poisson(l) branching process, in accordance with a conjecture of Aldous [All]. Section 6 calculates the 
entropy of fia- Section 7 discusses the connection between spanning trees of a lattice and domino tilings 
of the join of a lattice and its dual. From this follows a determination of the topological entropy for 
domino tilings of graphs that are joins of a periodic lattice and its dual. It is also possible from this to 
exhibit f.d.m.'s of the maximal entropy domino tiling in a few special cases. 

Certain characterizations of the Green's function and Harnack principles are required that are essen- 
tially adaptations of known results on "Z d to arbitrary periodic lattices. These adaptations are treated 
briefly in the appendix. Also given in the appendix are criteria for determining limits of probability 
distributions on trees from knowledge of certain functionals which act as generalized moments. 

2 Notation and a Green's Function 

Let G be a periodic lattice with the assumptions of connectedness, -D-regularity and there being at least 
one loop, as in the previous section. Let \ig denote the weak limit as n — ► oo of the uniform measures 
on spanning trees of the induced subgraph on G with vertices {(x,i) :\\ x ||oo < n}. The arguments 
in [Pern] show that this limit exists because the probability of the elementary event of a finite set of 
edges all being in the tree is always decreasing in n and because the measures of these elementary events 
determine jiQ. It follows from Corollary 3.4, the random walk construction of spanning trees and the 
first equality of Lemma 4.3 that, as in the case where G — the limit can be taken independent of 
the boundary conditions, i.e. the limit for induced subgraphs is the same as for tori. For an edge e, we 
often write P(e g T) for ^i{T : e g T}. 

For any finite set of edges e\, . . . , e/t that form no loop among them there is a graph G/ei, 
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called the contraction of G by ei, . . . , efe. Its vertices are the vertices of G modulo the equivalence 
relation of being connected by edges in {ei, . . . , eu}- Let the projection from vertices in G to vertices 
of G/e\, . . . , efc be called n. Then the edges of G/ei, . . . , are precisely one edge connecting n(x) to 
ir(y) for each edge connecting x to y in G. If W is a subset of the vertices of G, the induced subgraph 
of G on W is the graph with vertex set W and an edge xy for each edge xy € G with x, y e IT. For 
use in Section 7 wc include the dual notion to contraction, namely deletion. If e 1; . . . , are edges of 
connected graph G whose removal does not disconnect G then the deletion of G by ei, . . . , e^, denoted 
G — ei, . . . , efe is simply G with ei, . . . removed. The salient point is that deletion and contraction by 
different edges commute, so that G/ei, . . . , efe — e[, . . . , e\ is well-defined when 7^ for all i, j. 

Let S-RW^ denote simple random walk on G starting from x. More precisely, a path in G is a 
function / from the nonncgative integers to the vertices of G such that f(i + 1) is always adjacent to 
f(i); SRW^ makes all possible intial segments of a given length equally likely. Write P(SRW^(i) = y) 
for the probability that a simple random walk on G started from x is at y at time i. Either G or x may 
be suppressed in the notation when no ambiguity arises. For a subset B of the vertices of G, let dB 
denote the boundary of B, namely those vertices x € B that have neighbors in B c . For any x ^ B, let 
rjj? = inf{j : SRW x (j) G B} denote the (possibly infinite) hitting time of B from x. 

Define the vector spaces Vs, Vr^d and Vg to be the set of complex- valued functions on S, TL d and % d x 
S respectively, with pointwise addition, scalar multiplication, and the topology of pointwise convergence. 
For lie Vs and g G V^d let u <£> g denote the / € Vq for which f(x,i) = g(x)u{i). Let T d denote the 
rf-dimensional torus H d /'E d , written as <i-tuples of elements of (-1/2,1/2]. For a e T d and x € % d , 
the inner product a ■ x = X^»=i a i x i i s a 

well-defined element of HfE; for fixed a e T d , let £ Q e 
VV^d be defined by £ a (x) = e 27 ™"' 1 . Define the adjacency operator A : Vq — > Vg by (j4/)(x,z) = 
(y,j)~(x,i) fiVij)- Here, and in all subsequent such summations, the element (y, j) is to be counted 
as many times as there are edges from (x, i) to (y,j). A function / G Vg is called harmonic if Af = f and 
harmonic at (x, i) if (Af)(x, i) = f(x, i). We define a family of k by fc adjacency matrices {R x : x e S d } 
by letting R x (i,j) equal one if (0, i) <~ (x, j) and zero otherwise. Observe that R x = for all but finitely 
many x £ TL d and that = (R X ) T . For a <G T d , define the fc by k matrix Q(a) by the essentially 
finite sum D^ 1 T, xe ^ e 2ma ' x R x - 

Now we begin building Green's functions for simple random walk on G. For a transient walk, the 
Green's function H(x,y) can be defined as the expected number of visits to y starting from x. This is 
symmetric, and harmonic in each argument except on the diagonal. Here, we construct a function gf 
that is harmonic except at a finite number of points x, at which (7 — A)gf(x) is equal to some specified 



5 



/. Later, a uniqueness theorem will show that when f — 5 X for some x <G G, then g> specializes to 
H(x, •). In dimension two, simple random walk is recurrent. In this case, although the Green's function 
may still be defined classically by subtracting the expected number of visits from x to itself, the integral 
defining gf will blow up for / = 5 X . It will, however, be finite for / = S x — 5 V . 

When G is just ffi 1 with the usual nearest-neighbor edges, the eigenfunctions in Vg for the adjacency 
operator are just the functions The first lemma uses these to construct eigenfunctions for A. 

Lemma 2.1 Suppose u e Vs satisfies Q(a)(u) — Xu for some real A. Then A(u ® £") = \u <E> 



Proof: 



= ^ E i? z (^J>,e 27rM ' (a:+z) 
zeTL d .\<3<k 

= <? m - x E (3 E e 2 ™-*R*)(i,j) Uj 

= e 2 — 2 Q(a)(i,j) Uj 
i<j<fe 

= A(u®£ a )0M) D 

This lemma tells us how to invert (/ — A) on elements of Vg of the form u ® £ Q where u is an 
eigenvector of Q(a) with eigenvalue not equal to one. By representing general elements of Vg as integrals 
of eigenfunctions we can then invert (I— A) on these integrals since (I— A)^ 1 commutes with the integral, 
at least when absolute integrability conditions are satisfied. A preliminary observation is that for any 
ueV s , 

u ® £ Q da — w <g) S , 



I 



G 



where da is the usual Haar measure on T d . To see this, note that the integrand is bounded in magnitude 
by so the integral makes sense and integrating pointwise gives 



J u®i a {x,i)da = Ui J e Zm - x da^u t 8 G {x). 

Inverting (I — A) on elements of Vg with finitely many nonzero coordinates is easily reduced to inverting 
(I — A) on things of the form u ® So and their translates by elements of The above representation 
shows that these are integrals of u (B> which are sums of eigenfunctions (the cigcnfunctions given 
by letting u be an eigenvector of Q(a)). So the above representation solves the problem as long as the 
inverted integrand, (I — \)~ 1 u(&t; a remains integrable. With this in mind, observe that Q(a) is Hcrmitian 
for each a. It is therefore diagonalizablc with real eigenvalues and has a unitary basis of eigenvectors. 
Let {v(a, i) : a e T d , 1 < i < k} denote a measurable selection of ordered eigenbasis for Q(a) and let 
X(a,i) be the eigenvalue corresponding to v(a,i). Now for u e Vs, let c u (a,i) =<C u,v(a,i) 3> denote 
the coefficients of u in the chosen eigenbasis, in other words 

c u (a, i)v(a, i) = u 

i 

for each a e T d . Then we have the following theorem. 

Theorem 2.2 ForueVs, let 

g u = j^Re{c u {a,i){\- A(a, i) ® da (1) 

i=i 

and for j < d, define 

g u ' j = [j2 Re { cU ( a > l )( 1 -Ka,i)y\l-e~ 27Tl °' 3 )v(a,i)®C}da. (2) 
J i=i 

The integrals are meant pointwise, i.e. as defining g u (x) and g u ' : '(x) for each x e G. Then 

(i) The integrand in (2) is always integrable and the integrand in (1) is integrable when d>3 
or when d = 1 or 2 and J^i u i = 0- 

(ii) (I — A)g u — u <g> So and (I — Ajg 11 ^ = u ® (So — 5 ej ) whenever the integrals exist, ej being 
the j th standard basis vector in 

(Hi) g u and g 11 ^ are bounded whenever the defining integrals exist. 
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Remark: When d > 2, it is not necessary to take the real part in (1) and (2) since the imaginary part 
integrates to zero. When d = 1 however, the imaginary part fails to be integrable. 

For /, g G Vq, say / is a translate of g if f(x, i) = g(x + a, i) for some a G % d . If / G Vq has finitely 
many nonzero coordinates, then it can be represented as a finite sum of translates of elements of the 
form u® 5q. Further, if the sum of f(x,i) is zero, then / can be represented as the sum of translates 
of elements u <g> So for which J2 u j — together with translates of elements u <g> (do — S ej ) . Thus the 
following is an immediate corollary. 

Corollary 2.3 Let f G Vq have finitely many nonzero coordinates. If d > 3 or d = 1 or 2 and 
YlxeG f( x ) = 0; then the previous two theorems can be used to construct a bounded solution g to (I — 
A)g = f. □ 

The proof of Theorem 2.2 depends on the following lemma which bounds the eigenvalues of Q(a) 
away from 1 in terms of \a\ in order to get the necessary integrability results. 

Lemma 2.4 There is a constant K = K{G) for which max^ |A(a, i)\ < 1 — K \a\ 2 , where for specificity 
we take \a\ = max^ |aj|. 

Proof: The eigenvalues of a matrix are continuous functions of its entries [Kat], so it suffices to show 
that this is true in a neighborhood of and to show that X(a, i) ^ 1 for a^O. For the first of these, it 
suffices to find for each s < d a constant K s for which the eigenvalues of Q(a) are bounded in magnitude 
by 1 — if;5|a: s | 2 . So fix an s < d. 

Begin with a description of the entries of Q(a) r . The quantity Q{ct){i,j) is the sum over edges of G 
connecting (0,i) to (x, j) for x G of complex numbers of modulus 1/D. Furthermore, as i or j varies 
with the other fixed, there are precisely D of these paths. It follows that Q(a) r (i,j) is the sum over 
paths of length r connecting (0,z) to some (x, j) of complex numbers of modulus D~ r and that there 
are D r of these contributions in every row and every column. 

Suppose we can find an r = r(s) such that for every i < k there is a path of length r from (0,z) 
to (e s ,i) where e s is the s th standard basis vector. Since there is a self-edge at every vertex, there 
is perforce a path of length r from (0,z) to (0,i). These two paths represent summands in the above 
decompositon of Q(a) r (i, i) whose arguments differ by a s . By the law of cosines, the sum of these two 
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terms has magnitude D~ r (2 + 2cos(a s )) 1 / 2 < 2D~ r (l — ca 2 s ) for any c < 1/8 and a in an appropriate 
neighborhood of zero. Adding in the rest of the terms in the i th row of Q(a) r and using the triangle 
inequality shows that the sum of the magnitudes of the entries in the i th row is most 1 — 2cD~ r a 2 s for a s in 
a neighborhood of zero. The usual Perron-Frobenius argument then shows that no eigenvalue has greater 
modulus than the maximal row sum of moduli. This implies that in an appropriate neighborhood of zero, 
no eigenvalue of Q(a) has modulus greater than (1 — 2cD~ r a s ) 1 / r < 1 — K s a 2 where K s = 2cD~ r ^ /r(s), 
which is the bound we wanted. 

Finding such an r is easy. Since G is connected there is for each for each i < k a path of some length k 
from (0, i) to (e s , i). These can be extended to paths of any greater length by including some self-edges, 
so r can be chosen as the maximum over i of Zj. All that remains is to show that the only time Q(a) has 
an eigenvalue of 1 is when a = 0. This is essentially the same argument. Picking r' > max s r(s), the 
row sums of the moduli of of the entries of Q(a) r are strictly less than one and the Perron-Frobenius 
argument shows that no eigenvalue has modulus one or greater. □ 

Proof of Theorem 2.2: First we establish when the integrands in (1) and (2) are integrable. For each 
a the vectors v(a,i) form a unitary basis, hence the coefficients c u {a,i) are bounded in magnitude 
by Since v(a,i), £ a and 1 — e~ 27rQj all have unit modulus, integrability of |(1 — A(a,i)) _1 | is 
certainly sufficient to imply integrability of (1) and (2). For d > 3, this now follows immediately from 
|(1 - X(a,i))~ 1 \ < K\a~ 2 \. 

For d = 2, |a|~ 2 is not integrable, so we will find another factor in the integrands that is 0(|a|). By 
apcriodicity of SRW on G, we also have that the projected SRW on S is aperiodic, and therefore that the 
eigenvalue of 1 when a = is simple with eigenvector v(0, 1) = D -1 / 2 (l, . . . , 1). (Here we assume without 
loss of generality that the eigenvectors have been numbered so that for a in a neighborhood of zero, 
v(a, 1) is an eigenvector whose eigenvalue has maximum modulus.) The assumption J2i u % = m (1) then 
implies that c"(0, 1) = 0. By analyticity of the eigenbasis with respect to the entries of the matrix (at 
least away from multiple eigenvalues) [Kat], c u (a, 1) = 0(|a|). Thus \c u (a,i)(l — A(a, i)) _1 | < i^'lal -1 
which implies integrabilty of (1) when d = 2. Similarly, (1 — e 27Tlaj ) = 0(\a\), which implies integrability 
of (2) when d = 2. 

When d = 1, we will show that the real parts of 

[c>,lMa,l)®r]0M) (3) 

for ^2 i u i = and 

[c s ( Q) l)»(a ) l)(l-e- 2rta )0f](i ) i) (4) 



9 



for any u are both 0(|a| 2 ) as a — > for fixed (x,i) G G. Clearly, this is enough to imply integrability 
of (1) and (2). Observe that (3) and (4) are both zero when a = 0. By analyticity of v(a, 1) at zero, it 
suffices to show that derivatives of (3) and (4) with respect to a at zero are purely imaginary. Taking (4) 
first, we have 

^ (c>, l)v(a, 1)(1 - e- 2 ™*) ® T^, *)) |a=0 
= 2mc u (Q, l)w(0, 1) <g> i) = 2m < u, u(0, 1) > u(0, 1), G V^IlR, 
since the factor of 1 — e -2 ™ kills all the other terms in the derivative when a = 0. For (3), we get 

±(c"(a,l)v(a,l)®t a (x,i))\ a=0 

= K0,l)®£°]M^(a,l)U =o 

= u(0, 1), <C u, (d/da)| Q= oi;(a, 1) > 

so it suffices to show that v(a, 1) has imaginary derivative at zero. 

Observe first that Q{a) has imaginary derivative at a — since the entries of Q(a) are all sums of 
e 2Tnax £ Qr var j ous x ZlTL. Call this imaginary derivative R. Secondly, observe that the derivative of 
X(a, 1) vanishes at a = since X(a, 1) is real and attains its maximum at a = 0. Then letting w denote 
the derivative of v(a, 1) at a = 0, 

[Q + eR + 0(e 2 )}(v + ew + 0(e 2 )) = (1 + 0(e 2 ))(v + ew + 0(e 2 )) 

from which it follows that 

Rv= (I - Q)w + 0(e). 

Letting e — ■> gives Rv = (I — Q)w. Since R is imaginary and I, Q and v are real, it follows that w is 
imaginary. This shows that the real part of (3) is 0(|a| 2 ) and completes the proof of (i). 

The above argument actually also establishes boundedness of g u and g u 'i when d>2, but we give a 
different probabilistic argument since it is necessary to do so anyway for the case d = 1. Let s G (0, 1) 
be a real parameter and consider the functions and <?" J gotten by replacing X(a,i) by s\(a,i) in (1) 
and (2). The integrands are a fortiori absolutely integrable, being bounded in magnitude by |1 — s| _1 
times a possible factor of 2 for the 1 — e~ 27rtaj . Thus in fact g™ G l°°(G). Now taking the real part is no 
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longer necessary, since the imaginary part is an odd function of a and must integrate to zero. We have 
then 

k 



g u s = [^r^2s n \{a,i) n c u (a,i)v(a 1 i)®Z a da (5) 

•> i=l n=0 
oo k 

n=0 ^ i=l 

oo 

= ^s" / ^c"(a,i)A"(t;(cM)®£ a ) 

n=0 ^ i=l 
oo / „ k \ 

= ^ s"A™ / ^ c u (a, »)(v(a, i) <8> f) da 

n=0 V"^ i=l / 

oo 

= jS n A n (u®Jo). 

The reason A may be commuted with the sum and integral is that Af(x) is a finite linear combination 
of terms f(y) for y e G, and each of these terms is integrable. In a similar manner, we get 

oo 

tf J ' = X>M n (u®((Jo-<f ei ))- (6) 

n=0 

Since A n gives the transition probabilities for an n-step simple random walk on G, this means that 



n=0 

where X n is a SRW^ killed with probability 1 — s at each step. Similarly, 

oo 

S^(x) = ]TE U ®(<5o-<5 ej )PQ. 

n=0 

It follows from this that g™, <?" J — > as x — > oo for fixed s, u, j. From the forward equation for the random 
walk (or by direct calculation from (5) and (6)), g" = sAg™ + u®8q and y"'- 7 = sAg^'i + u ® (<5 — <5 ej ), 
whence it follows that (resp. g"'- 7 ) cannot have a maximum or minimum except on the support of 
u ® #o (resp. u (g) (<5o — <5 e ., ))- Since u® So (resp. u ® (<5o — S ej j) has finite support, say W C G, this 
implies that for all y e G, 

min ff » (a:) < y s u (y) < max 5 « (x) , (7) 
xgVv sew/ 
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and similarly for g"'- 7 . The proof of intcgrability of (1) and (2) shows that — > and g™'- 5 — ► j* J as 
s — > 1. Taking the limit of (7) gives 

min <?"(x) < ^(y) < max<?"(x) 
and similarly for g"'^, establishing (iii). 

Finally to show (ii), we have from (6) that 

(I- sA)g™ =u®6 . 

Since — > <? u pointwise as s — > 1 and since is a finite sum of values g™(y), the limit of the LHS 

as s — > 1 exists and is equal to (I — A)g u . A similar argument for g u ^ completes the proof of (ii) and 
of the theorem. □ 



3 Simple Random Walk on G 



Define the cube B n of size n in G to be those vertices (x, i) for which \xj\ < n : 1 < j < d. For x £ B n , 
define the hitting distribution on the boundary, u Bn , to be the law of SRW x (r Bn ). Thus for example, 
if x £ dB n , then v Bn = S x . For x ^ B n , define the hitting distribution on the boundary to be the same, 
but conditioned on the SRW hitting the boundary; thus z/f"(C) = P(SRW x (t x 3 ") £ C)/P(rf" < oo). 
For n < m and x £ d(B^), let p B " Bm be the hitting distribution on 9_B„ of a SiiW^ conditioned 
never to return to B^; for m < n and x € dB m , let p B " Bm be the hitting distribution on <9-B n of 
S-RWa; conditioned not to return to B m . The following lemma is an adaptation of the discrete Harnack 
inequalities on % d for general periodic lattices. The proof is merely an adaptation of the proof for % d 
and will be sketched in the appendix. 



Lemma 3.1 (Harnack principles) Let G be a periodic graph satisfying the assumptions of the first 
section. Fix a positive integer n. Then as m — > oo 7 

(i) max v B ™({z})/v B ~({z})^l 

x,y£B n ,zedB m 

(ii) max p B ^({z})/p B ^({z})^l 

x.yEdBn.zEoBm 
zedB n ,x,yeB^ 
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(W) P » H max - 1 

(v) (i) — (iv) hold when G is replaced by a finite contraction of G 

□ 



Corollary 3.2 Bounded harmonic functions on finite contractions of periodic graphs are constant. 

Proof: For any vertex x, Let Xo,X\, ... be a simple random walk starting from x. If g is harmonic 
then {g(Xi)} is a martingale, and if g is bounded and x G B m then optional stopping gives g{x) = 
Eg(X(T^ m )) = J g{z)dv^ m (z). By (i) of the previous lemma, v^ m (z) = (1 + Q{l))vy m {z) as m — > oo, 
hence g(x) = g(y) and g is constant. □ 

Corollary 3.3 Let G be a finite contraction of a periodic graph. Then for f G Vq there is, up to an 
additive constant at most one bounded solution g to (I — A)g = f . 

Proof: If gi and gi are two solutions then g\ — g 2 is a bounded harmonic function. □ 

Corollary 3.4 Let G be a finite contraction of a periodic graph and let G n be the induced subgraph on 
B n . For x,y,z G G with x ~ y define h(x,y, z,n) = P{SRW^ n (r^ — 1) = x) to be the probability that 
SRW^ n first hits y by coming from x (with h(x, y, y, n) =0). Then linin^oo h(x, y, z, n) exists for all 
x,y,z. 

Proof: Fix L such that x,y G Bl- For w G Bl define 

Mx,V,w) = P(r y w < oo and SRWM - 1) = x) 

and 

(f>2{x,y,w) = P(r£ < oo and SRW w {tI - 1) ± x). 

For i = 1,2, let <fii(x,y,w,n) be <fii(x,y,w) with the clause r*^ < oo replaced by < t w " and observe 
that <fri(x, y, w, n) — > <fii(x, y, w) asm oo. From the Harnack lemma we know that p^ LBm approaches 
a limiting measure p on 8Bl as m — > oo. We claim that 

h(x, y, z, n) -> P(rf < oo and SW z (t« - 1) = x) (8) 
+ P(r» = oo) / Mw)/(Mw) + Mw))dp(w). 
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To see this, write h{x, y, z, n) as P(SW Z G (r» - 1) = x and r| < rf ") + P(t|' > rf ")P(SWf " (r| - 
1) = x | r| > rf n ). The first of these terms is clearly converging to the first term in (8), while the 
first factor of the second is converging to P(r| = oo); the second factor is a mixture over u € dB n of 
P(SRW^ n (T^ — 1) = x), so it suffices to show that this is converging to the integral in (8) uniformly 
in u and n — > oo. Consider the sequence of times Ti, o\, t%, 02, . . . where T\ is the first time that SRW U 
hits Bi Jl a\ is the nest time it hits dB n , t 2 is the next time it hits B^, and so forth. The first hitting 
time t% of y must satisfy n < t% < Ui for some i. Now write 

P(SRW^(ry-l)=x) = 

E,~i P(SRW^ ( T y -i) = x: n <ry< 

ESi P(^W„ G " (t£ - 1) ? x; n < rl < at) + P{SRWS n {t% - I) = x;n < < a t ) 
The sum in the denominator is of course 1 , but the point of writing it this way is to illustrate that for 
each i the ratio is approximately the integral in (8). More precisely, for fixed i the Markov property 
gives that P(SRW^ n (ry — 1) = x; < t% < <Ji) is equal to P(r^ > <7i_i) times a mixture over 
v € dB n (corresponding to the last exit from dB n before n) of J <fii(x,y, w,n) dp BhBn . Similarly, 
P(SRW^ n (T^ — 1) 7^ x;Ti < t-{ < Oi) is equal to P(r^ > <7i_i) times a mixture over w <G dB n of 
J 4>2{x,y,w,n) dp BhBn . Since 0j(ar, y, u>, n) — * ^(x, y, w) and p BhBn = (1 + o(l))p as n — > 00, this shows 
that the ratio of the numerator to the denominator in the sum is (1 + o(l)) times the integral in (8) and 
proves the corollary. □ 



4 Transfer Impedance 



Before going into the definition of transfer impedance, it is worth pausing to remark that the functions 
g u constructed in Theorem 2.2 really are versions of the Green's function. This is not essential to any 
of the arguments below, so the proofs are relegated to the appendix. Define the usual Green's function 
H(-, •) on pairs of vertices of a periodic graph G by 

00 

H(x,y) = Y / P(SRW x (n) = y) 

when d > 3, and 

00 

H(x,y) = Yl\P(SRW x (n) = y) - P(SRW x (n) = x)} 
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when d = 1 or 2. It is easy to see that the sums are finite and that H(x,y) is harmonic in y except at 
y = x; it will also be shown that H is symmetric. [Later, we will use the above definition of H for finite 
graphs as well; see the appendix.] Now let Vo C Vg be the subspace of all functions with finite support. 
Think of the vertices of G as embedded in Vo by x ^ 5 X and the oriented edges xy as embedded n Vo 
by xy i ^ $x ~ 5y . Now extend H to a bilinear map on Vo x Vo . Similarly, think of the functions g u from 
Theorem 2.2, or in general the solution gf to (I — A)g = f from Corollary 2.3 as defining a bilinear form 
g on Vo x Vo (or when d < 2, on part of Vo x Vo) by letting g(f, S x ) — gf(x) and extending linearly. We 
then have 

Theorem 4.1 g = H whenever g is defined. Consequently, g is symmetric. □ 

Define the transfer impedance of two oriented edges e and / to be g(e, /). For any finite set e±, . . . , e k 
of edges, define their transfer impedance matrix M — M(ei, . . . ,e k ) to be the k by k matrix with 
M(i,j) = D~ 1 g(e i ,ej), where D is the degree of the graph G. Observe that the determinant of the 
transfer impedance matrix is independent of the orientation of the edges, since changing the orientation 
of ei has the effect of multiplying both the i th row and the i th column of M by — 1 . 

Theorem 4.2 Let G be any periodic graph satisfying the assumptions of the first section. For e 1; . . . , 
edges of G, pick an orientation for each edge and let M denote their transfer impedance matrix, so 
DM(i, j) = g{ei,Cj) = g Sx ~ Sy (z) — g Sx ~ 6 v (w), where ej = xy and ej — zw. (Here g may be defined by 
Corollary 2.3 or by extending H linearly, if H is already known.) If 'T is a uniform essential spanning 
forest for G, then 

P(ei,...,e fc eT) = det(M). 

Here is an outline of why Theorem 4.2 is true. For an oriented edge e = xy, the function (1/D)g(e, •) 
gives the voltages at vertices of G when each edge is a one-ohm resistor and one amp of current is run 
from x to y. A straight-forward application of Cramer's rule then produces a function h(z) that is a 
linear combination of g{e%, •) for % = 1, . . . , k and which computes the voltage for a unit current across 
efe in the graph TL d j e\, . . . ,ek-i- By the equivalences between electrical networks, random walks and 
uniform spanning trees [Pern], h(x) — h(y) computes the probability P(efe € T | e\, . . . , eu-i € T). The 
conditional probability turns out to be det(M(ei, . . . , e^))/ det(M(ei, . . . , e^-i)) and multiplying these 
together gives P(ei, . . . , e k g T) = det(M). 
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Lemma 4.3 Let G be a periodic graph and for edges ei, . . . ,efe forming no loop let G' = G/ei, . . . , e k -i- 
Let e k = xy in G . Let <f> be a bounded function on the vertices of G' harmonic everywhere except at x 
and y, with excess 1/D at x and — 1/D at y. LfT is the uniform random essential spanning forest of G 
and T" is the uniform random essential spanning forest of G' , then 

P(e fe G T') = P(e k e T | ei, . . . e k ^ e T) = <f>(x) - <j>(y). 

Proof: The first equality is standard [Pern]. For the other one, recall from [Pern] that P(e k € T') is de- 
fined as the limit of P(e^ e T' n ) where T' n is the uniform random spanning tree on G' n = G n /e\, . . . , e k -\- 

Ql 

This probability is just the probability that SRW X " first hits y by moving from x. Now Corollary 3.4 
shows that the probability h(x,y, -,n) of SRW. " first hitting y from x converges as n — > oo to some 
function h(x, y, z). Since h(x, y, z, n) is harmonic in z except at x and y, so is the limit. (In this notation, 
the probability we are after is h(x,y,x).) Also, it is easy to see that the excess of h(x,y, -,n) at x is 
1/D (use the forward equation). Thus the excess of h(x,y, -,n) at y must be —1 and the limit satisfies 
(/ — A)h(x, x, •) = (5 X — 6 y )/D. By Corollary 3.3, there is only one such function up to an additive 
constant, so <f> must equal h(x, y, •), thus 4>{x) — <p(y) = h(x, y, x) — h(x, y, y) = h(x, y,x). □ 

Proof of Theorem 4.2: Proceed by induction on k. When k = 1, detM = M(l, 1) = [g s *~ s y (x) — 
g s *~ s y (y)]/D, where e\ = xy. Denote this by (1/D)g ei . According to Theorem 2.3, (1/D)g ei is bounded 
and solves (I— A)g = {5 X — 5 y ]/D, so by the previous lemma, M(l, 1) calculates the probability of e\ e T. 

Now assume for induction that the theorem is true for k — 1. The easy case to dispose of is when 

def 

M = M(ei, . . . , ej;_i) has zero determinant. Then by induction P(ei, . . . , e k -i G T) = and it follows 
(e.g. from the random walk construction of T in [A12, Bro, Pem]) that the edges e\, . . . , e k -i form some 
loop. Suppose without loss of generality that the loop is given by e\, . . . , e r and that all edges are oriented 
forward along the loop. Then Y^i=i 9 &i — 0> hence M is singular and det(M) = P(ei, . . . , e k € T) = 0. 

In the case where det(M') ^ 0, the inductive hypothesis says that P(ei, . . . , e k -\ € T) = det(M') 
and it therefore suffices to show 

P(e k e T|ei,...,e fe _i E T) = det(M)/ det(M'). (9) 

To show (9) we construct the function <f> of Lemma 4.3. Write xtyi for a. Electrically, what we will 
be doing is starting with the function (1/D)g ek , which is the voltage function for one unit of current 
put in at x k and taken out at y k , and adjusting it by adding a linear combination of functions g &i for 
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j < k in order to exactly cancel the current through each ej, j < k. This is then the voltage function for 
G/ei, . . . , e k -\ when one unit of current is run across ir(e k ), and thus its difference across e k computes 
P(e k G T'). 

To do this formally, let for i = 1, . . . , k — 1 be real numbers for which M^ + E*^ 1 a i-^ij = 
for 1 < j < k — 1. These equations uniquely define the a* because the columns of M' are linearly 
independent and hence there is a unique linear combination of them summing to v where Vi = —M^. 
Let TV be the k by k matrix for which Nij = Mij for j < k and Nik = Mik + Y^jZi a j^ij'i m other 
words, the first k — 1 columns are used to zero the nondiagonal elements of the last column of M and 
N is the resulting matrix. Then detM = detTV = Nkkdet(Nj : i,j < k — 1) = TVfcfcdetM', whence 
Nkk — detM/ detM'. Now define a bounded element J of Vg by 

fe 

i=i 

with ctk = 1. We verify that 

(i) Forj<fc, Jfe) - J(y,) = 0; 

(ii) J(x fe ) - J( Wfc ) - dct(M)/dct(M'). 

Indeed, J{x j ) - J{y j ) = D^ 1 Ym=i a i9 Cz ( x j) ~ 9 e% {Vj) = D ~ X Ei=i &i DM ij so (*) follows from the 
definition of a, and (m) follows from the determination of N k k above. 

Now we have just shown that J is constant on the pre-image of any vertex under the contraction 
map 7r, and hence there is a well-defined function <p on the vertices of G for which J = o n. The excess 
of <p at a point z is just the sum over edges zw in G of <f>(z) — 4>(w). This is just the sum of J(u) — J(v) 
over edges m; for which ir(uv) = zw, which is just the sum of excess of J at u over u e tt^ 1 (z). The 
excess of J at u, (I — A)J(u), is just ^ Q!j(7 — A)Z? _1 (/ ei (u) which is just Z) -1 J^- ai(S Xi (u) — 5 yi {u)). 
This must be summed over 7r _1 (z), which, for i < k contains Xj if and only if it contains y^. Then the 
only possible nonzero contribution to the sum is otk(5 Xk (u) — S Vk (u)) and summing this over u G tt^ 1 (z) 
gives 1 if z = n(xk), —1 if z = n(yk) and zero otherwise. Thus (f> satisfies the conditions of Lemma 4.3 
and hence P(e fe e T | e u . . . , e k -i € T) = (f>(ir(x k )) - (f>{n{y k )) = J{x k ) - J(y k ) = dct(M)/ dct(M') by 
property (ii) above. This finishes the induction and the proof of the theorem. □ 

For ease of calculation, we derive a corollary to Theorem 4.2. 
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Corollary 4.4 With T, M and ei, . . . , e k as in Theorem 4-2, pick an integer r with < r < k. Define 
a k by k matrix M (r) by M {r \i,j) = M(i,j) if i > r and Sij - M(i,j) if i < r. Then P(ei, . . . , e r 
T, e r+1 ,...,e k € T) =det(M( r )). 

Proof: The assertion for r = is just the previous theorem; now assume for induction it is true for 
r — 1. By linearity of the determinant in each row of a matrix, we have det + det M^- 1 ) = det P 
where P(i,j) = M^ r \i,j) = M^ r ~ x \i, j) \ii^r and P(i,j) = 8ij if i = r. Expanding by minors along 
the r th row of P gives detP = det M( r_1 )(ei, . . . , e r _i, e r +i, . . . , e k ) which is equal to P(ei, . . . , e r _i ^ 
T, e r+ i, . . . , efe e T) by induction. Also by induction, det M (r_1 ' = P(ei, . . . , e r _i ^ T, e r , . . . , ek € T), 
whence by subtraction, det = P(ei, . . . , e r ^ T, e r +i, . . . , e k <E T), as desired. □ 

We end this section with a discussion of the case d = 0, or in other words, finite graphs. Let G be a 
connected aperiodic (i.e. non- bipartite) graph on the vertices S = {1, . . . , k}, where as usual self-edges 
have been added to make the graph D- regular. The transition matrix A has a simple eigenvalue of 1, 
so it is immediate that for any u such that J2i Ui = there is a solution g u e Vs to (I — A)g u = u 
and it is unique up to an additive constant. Defining the transfer impedance matrix by M(xy, zw) = 
g s *~ s v(z) — g s *~ s y(w) as before, Lemma 4.3 shows again that M(l,l) calculates P(ei € T) and the 
induction is completed as before, showing that P(ei, . . . , e k € T) = det M(ei, . . . ,e k )- Now add another 
self-edge to each vertex so the degrees are all D + 1 . If A' is the new transition matrix then I — A' = 
(1 - (D + - A) and hence the solution g ul to (I - A')g ul = u is just (1 + D~ 1 )g u . Thus the 

new transfer impedance matrix is the same as the old one, and hence the transfer impedance matrix is 
independent of the degree D at which we choose to equalize the loops. The electrical explanation for 
this is that M(e, /) is the induced voltage across / for a unit current with source x and sink y, where 
e = xy and every edge of G is a one ohm resistor. (To prove this just add self-edges to regularize the 
degree; this leaves M and the electrical properties unchanged and they now solve the same boundary 
value problem.) The random walk interpretation [DS] is that M(e, /) is the expected number of signed 
transits across / of SRW X stopped when it hits y. 

In particular, suppose G is a finite, connected graph but not necessarily having vertices of the 
same degree. We have seen that the transfer impedances for G may be unambiguously defined as the 
transfer impedances for any graph that extends G to a D-regular graph for some D by addition of 
self-edges. Write deg*(x) for the number edges incident to v that are not self-edges. Then deg* is 
invariant under degree equalization. The relevance of deg* to transfer impedances is that for many 
graphs M(e, /) is approximately equal to X^een/ deg^x)^ 1 . In other words, M(xTj, xy) is aproximately 
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deg*(x) 1 + deg*(y) , M(e,f) is approximately deg*{x) 1 if e and / intersect at x, and M(e, /) is 
approximately zero in other cases. 

To see why this should be true, consider the condition 

deg Sf (x) > 2e~ 1 and 

(i _ deg,(x) ^ p( S RW z hits x before y) < (1 + e)- de9 * {x) 



deg*{x) + deg*{y) deg*(x) + deg*{y) 

for all distinct x, y, z in some subset W of G. This says that a SRW from any point other than x or y in 
W will get "lost" with high probability before hitting x or y and will in fact end up choosing which to 
hit first in proportion to the stationary measures at the two vertices. There are many families of graphs 
Gk for which (10) holds with W = G for a sequence €k converging to zero as k — > oo; examples include 
the complete graph on k vertices and the fc-cube. 

Assume condition (10) for some e > 0. For some vertex x, all of whose neighbors arc in W, 
enumerate its neighbors y\ , . . . , ydeg, (x) (allow repeated neighbors if there are parallel edges). Let 
ei = xy i for i < deg*(x). Use the interpretation of M(a,ej) as the expected number of signed 
transits across ej for a random walk started at x and stopped at yi. Reversibility implies that the 
expected number of signed transits is zero over all times before the last visit to x, so conditioning on 
the first step of SRW X and using the "craps" principle shows that for fixed i and varying j, M(ei,ej) 
is proportional to P(SRW yj hits yi before x). Applying condition (10) together with the fact that 
P(SRW yi hits yi before x) = 1 shows that, when j ^ i 7 

(1 - e)deg*(yi)/(deg*(x) + deg*{yi)) 



<M{e l ,e j ) < 



1 + (deg*(x) - 1)(1 + e)deg*(yi)/(deg*(x) + deg^yi)) 
(1 + e)deg*(yi)/(deg*(x) + deg*(yi)) 



1 + {deg*(x) - 1)(1 - e)deg*(yi)/(deg*(x) +deg^(y l ))' 

Since deg*(x), deg*(y) > 2e _1 , deg*(x)deg*(y)/(deg*(x) + deg*{y)) > e _1 and the addition of 1 in the 
denominator of the first term in the above inequality loses no more than a factor of 1 — e, and we may 
rewrite the inequalities as 

(1 - e) 3 ^*^)- 1 < M(e i ,e j ) < (1 - e) _2 de0*(a:) _1 . 

Similarly, 

(1 - efideg^x)- 1 + deg^y,)- 1 ) < M(e,,e 4 ) < (1 - e^ideg^x)- 1 + deg^y,)- 1 ). 



19 



Finally, for e = xy and / = zw such that x,y,z,w and all the neighbors of x are in W, we have 
M(e,f) = M(e,e)[P(SRW z hits x before y) - P(SRW W hits x before y)] < 2e(l - e)- 2 (deff»(a;)- 1 + 
deg*(y)~ 1 ) . This follows from the electrical interpretation of M(e, /), since a unit current flow puts a 
voltage difference of M(e, e) across e, after which the voltages elsewhere are M(e, e) times the probability 
from there of SRW hitting x before y. Thus the mixing condition (10) does indeed imply that 

M(e,/)= ]T deg^x)- 1 +0(e)J2deg*(x)- 1 (11) 

xEenf xEe 

for x,y, z,w e W with all neighbors of a; inside W. 



5 An Example and a High Dimensional Limit 



The case where G is the nearest neighbor graph for Tl? is special because the Green's function can be 
explicitly evaluated as a polynomial in n^ 1 . Following [Spi], we have that the Green's function is given 

by 



H(0,x) = (2n)- 2 J — 



1 — cos(a; • a) 



da 



(l/2)cos(ai) - (l/2)cos(a 2 ) 
and for x = (n, n) a change of variables from (ai, 0:2) to (a\ + a 2 , ct\ — a 2 ) yields 

1 



i/((0,0),(n,n))-47r- 1 



-3 



In- 1 



These values, along with the symmetries of the lattice and the fact that H is harmonic, allow H to be 
determined recursively, the first few values being 
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4/t 



4- 8/tt 

G A- 



4/r 



4 -8/tt 



1 4/t 



1 4/t 



4- 8/tt 



1 4 - 8/tt 
4 o 



Let u>i, u>2, u>3, W4 denote respectively the edges connecting the origin to (1, 0), (0, 1), (— 1, 0), (0, — 1). 
The above values for H then yield the following circulant for M: 



M(wi, ...,w 4 ) = 



1/2 1/2-7T- 1 2tt- 1 -1/2 1/2-tt- 1 

1/2-7T- 1 1/2 1/2-tt- 1 2tt- 1 -1/2 

2tt- 1 -1/2 1/2-tt- 1 1/2 1/2-tt- 1 

1/2-tt- 1 2tt- 1 -1/2 1/2-tt- 1 1/2 



Theorem 4.2 and Corollary 4.4 assert for example, that P(w>i, W2, W3, W4 G T) and P(wi, w 2 , W3 ^ 
T, W4 e T) are given respectively by det M and det respectively, where 



1/2 -1/2 + TT- 1 -2tt- 1 + 1/2 -1/2 + tt- 1 

-1/2 + tt- 1 1/2 -1/2 + tt- 1 -2tt- 1 + 1/2 

-2tt- 1 + 1/2 -1/2 + tt- 1 1/2 -1/2 + TT" 1 

1/2-tt- 1 2tt- 1 -1/2 1/2-tt- 1 1/2 



The determinants of M and M (3) are respectively (4TT- 1 - 1)(2tt- 1 - l) 2 and 2tt~ 2 - 4tt- 3 , so the 
probability of all four edges incident to being in T is (4tt _1 — 1)(2tt- 1 — l) 2 w .0361, while the 
probability that the origin is a leaf of T (i.e. has degree one in T) is 8tt~ 2 — 16tt- 3 w .2945. 
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The remainder of this section corrects, proves and generalizes some conjectures of Aldous about 
spanning trees for graphs as the graphs tend to infinity locally, in the sense that the minimum number 
of neighbors of a vertex all grow without bound. We first quote Conjecture 11 from [A12]. To do this, 
let Gfc denote a sequence of finite graphs, each with a distinguished vertex v k - Let = deg*(vh), let Ak 
be the set of neighbors of Vk in Gk and for w e Ak, let ipk(w) = P(SRW^ k hits Ak \ {w} before Vk). 
Aldous then conjectures the following [A12, Conjecture 11]. 

Let 1 + Dk denote the random degree of Vk in the uniform random spanning tree on Gk ■ 
Suppose that 

r k -> oo;sup |r fe (l - ip k (w)) - 1| -> 0. 

w 

Then Dk converges in distribution to a Poisson with mean 1. 

Here is a counterexample to the conjecture. Let the vertices of Gk other than v k be {xi,yt, Zij : 1 < 
i < k; 1 < j < Ak}, with edges connecting Vk to each Xi and t/j and for every i, an edge connecting Xi to 
each and an edge connecting t/j to each Zij. Then r k = 2k. By symmetry, we have tpk{w) = ipk(xi) 
for any w G Ak- This is equal to P(SRW Xl hits y\ before Vk) = 2fc/(l + 2k), hence r k {l — %jjk(w)) = 
2k/ (2k + 1) — > 1 and the hypothesis in the conjecture is satisfied. But for each i, any spanning tree 
contains either an edge connecting Vk to X{ or an edge connecting Vk to y,, so the degree of Vk is at least 
k. 

Evidently, a condition different from rfc(l — ipk(w)) converging uniformly to 1 is required for the 
conjecture to be true. Aldous' condition is trying to capture two aspects of the graph: some sort of 
mixing (SRW from any neighbor of Vk returns to Vk before A with the same probability) and the correct 
total probabilities (these probabilities should all be about r^ 1 so they can sum to 1). The mixing part of 
the condition as stated in the conjecture is too weak, as illustrated by the counterexample, and needs to 
be replaced by a condition that equalizes the individual return probabilities of SRW W hitting Vk before 
z for any neighbors w, z of v k - The most natural such condition from our viewpoint is (10) with x = Vk 
and e = £fe for some {e^} going to zero. On the other hand, there is no need to require all the neighbors 
of Vk to have the same degree. Once a sufficient amount of independence has been achieved, it suffices 
for the expected number of such edges in the tree to be converging to a constant, 1 + A. The conjecture 
may thus be revised to yield the following theorem. 

Theorem 5.1 Let Gk be a sequence of finite graphs. Let — > be a sequence of positive numbers and 
let Vk,Ak,Dk and rk be as above. Assume that (11) holds with W — Ak and e = e^; this is implied for 
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example by (10). If in addition, J2 w eA k l/^ e .9*( w ) - > 1 + A as - >oo for some positive X, then Dk 
converges to a Poisson with mean X. 

Remark: Since the theorem gives local behavior at v k , it can easily be extended to the case where Gk 
are infinite graphs on which there is a Harnack principle: simply take G' k to be a large enough finite 
piece of Gk so that the hypotheses of the theorem are true (this is possible by the Harnack principle). 

The following lemma will be necessary when calculating determinants of transfer impedance matrices. 

Lemma 5.2 Let a\, . . . , ak+i be positive real constants and let e\, . . . be the edges of a spanning tree 
whose vertices are {1, . . . , k + 1}. Define a k by k matrix M by letting M(i,j) be a r + a s if i = j 
and ei connects r to s; a r if a and ej are distinct edges meeting at r; and zero otherwise. Then 
det M = 01 ar 1 ). 

Proof: If k = 1 then M = (a± + a 2 ) and the lemma is clearly true, now assume for induction that the 
lemma is true for k — 1. Assume by renumbering if necessary that the vertex 1 is a leaf of the tree, there 
being a single edge e\ connecting 1 to 2. Also assume that 2 is connected to 1, 3, 4, . . . , r by ei, . . . , e,._i 
respectively, for some r < k + 1. Row reduce M by subtracting 02/(01 + a 2 ) times the first row from 
rows 2, . . . , r. Since edges 2, . . . , r were the only edges incident to ei, this clears the first column. The 
remaining entries of M are unchanged except that all appearances of a 2 get replaced by aia 2 /(ai + a 2 ). 
Expanding along the new first column gives detM = {a\ + a 2 ) times the determinant of the k — 1 by 
k — 1 matrix gotten by taking all but the first row and column of M and replacing a 2 by aio 2 /(ai + a 2 ). 
By induction, the latter determinant is {a\/{ai + a2))(Ili> 2 a i)(( a i + a 2)/aifl2 + Si>3 a 7 1 )j so det M = 
dl fl i)(( a i + a 2 )/a 1 a 2 + Y,i>3 a 7 1 ) = H a i S a 7 1 as desired. □ 

Proof of Theorem 5.1: Let A be a random variable for which A — 1 is Poisson with mean A. Set 

4>{z) = P(X = n)z n = ze x{z - Y \ 

Then the s th factorial moment E(A) S d =EA(A - 1) • • • (A - s + 1) of A is the s th derivative of at 
1, which is equal to A s + sA s_1 . The factorial moments determine this distribution uniquely, since its 
moment generating function exists in a neighborhood of zero, from which it follows that convergence 
of the factorial moments of a sequence of random variables to the factorial moments of A implies 
convergence in distribution to A (see for example Theorem 3.10 of [Du]). It suffices therefore to prove 
that E(l + D k ) s — ► X s + sA s_1 as k — > 00 for each s. 
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Write E(l + Dk)s as P( e ij • ■ • , e s G T) where the sum is over all ordered collections (ei, . . . , e s ) of 
s distinct edges incident to Vk- Fixing such a collection, we have P(ei, . . . , e s £ T) = detM(ei, . . . , e s ). 
From (11) we have that M(i,j) = (1 + 0(ek)){deg*(x)~ 1 + 5ijdeg*(yi)~ 1 ) where = xy i . With s staying 
fixed, this and Lemma 5.2 give 

s s 

detM = (1 + o(l))([]de 5 »( U ; l )" 1 )(l + r^ 1 ^deg,(wi)). (12) 
i=l i=l 

Summing over all ordered collections gives 

s S— 1 

detM=(l + o(l))[ 5i n^K)^ 1 + 51 s^fc-sK'n^Kr 1 ] 

(ei,...,e s ) i=l (ei,...,e s _i) i=l 

since each ordered collection of size s — 1 appears s(rfe — s) times in the second term of (12) and each 
ordered collection of size s appears once in the first term of (12). As s remains fixed with the minimum 
degree among x and its neighbors converging to infinity, the above expression for det M converges to 
Ei deg^(w i )^ 1 ) s + s(^2 i deg^(wi)^ 1 ) s ^ 1 —* X s + sA s_1 , proving the convergence of factorial moments 
and the theorem. □ 

Suppose now that we are interested not only in the degree of Vk but in the local structure of the 
essential spanning forest near Vk- For any locally finite rooted tree T, let T A r denote the random finite 
subtree of T consisting of vertices connected to the root by paths of length at most r in T. We say that 
a sequence of tree- valued random variables converges in distribution (written — >T) if A r — >T A r 
for every r, where the latter is defined to hold when P(Tfe A r = t) — > P(T A r = t) for every t of height 
at most r. Under suitable conditions on the graphs Gk, it will turn out that the component of the 
uniform essential spanning forest on Gk rooted at Vk will converge in distribution to a particular tree 
V\ which we now define. Let V\ be a singly infinite path, xq, Xi, . . ., to which has been added at each Xi 
the tree of an independent Poisson (1) branching process (which is critical hence finite with probability 
one). Another way of describing T is as the tree of a Poisson (1) branching process rooted at x and 
conditioned to survive forever. Aldous has conjectured (personal communication, though the conjecture 
is implicit in [A13]) that converges in distribution to V\ whenever Gk grows locally in a sufficiently 
regular manner. In the terminology of [All], should converge to a sin-tree with the fringe distribution 
of a Poisson (1) branching process. This is known in the special case where Gk = Kk, the complete 
graph on k vertices [A13, Gr]. 

We are now in a position to prove this. A consequence is that the probability of there existing two 
disjoint paths of length Lk in from Vk goes to zero as k — > oo, provided that Lk — > oo. An question 
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left open in [Pern] is whether the components of the uniform essential spanning forest have one or two 
ends (the possibility of more than two is ruled out by an argument in [BK1]). We believe the answer 
to be that all components have one end, and convergence to zero of the probability of there being two 
disjoint infinite paths from v k can be viewed as a heuristic argument in favor of all components having 
one end. 

Theorem 5.3 Let G k be a sequence of finite graphs with distinguished vertices v k - Assume, by renum- 
bering if necessary, that 

(1 + o(l)) maxdeg*(x) = k = (1 + o(l)) min deg* (x) 

x£G xeG 

as k — > 00. Also assume the following version of (11) uniformly in edges e, / € G: 

M(e,/) = fc- 1 (|en/|+o(l)). 

Then T k ^Vi . 

Remark: For r=l, TArisa star centered at v k with 1 + D k edges, while V\ A r is a star centered at 
x with 1 + X edges, where X is Poisson with mean 1. Thus the case r = 1 is essentially the previous 
theorem. Notice also that the usual families of graphs G k (e.g. complete graph on k vertices, fc-cube, 
fc/2-dimensional torus of arbitrary length) all satisfy the hypotheses of the theorem. 

Proof: For a finite rooted tree t and finite rooted graph u, say that a map / from the vertices of t to 
the vertices of u is a tree-map if / is injective, maps the root of t to the root of u, and f(x) <~ f(y) for 
each x <~ y. Let N(u; t) denote the number of distinct tree maps from t to u. For example, if t and u 
are stars of respective sizes s and r about their roots, then N(u;t) = (r) s . The proof of this theorem 
generalizes the proof of the preceding theorem, in the sense that ~EN(T k ;t) is a sort of generalized t th 
moment of T k . In the appendix, the usual tightness criteria for convergence of probability measures are 
extended in an obvious way to tree-valued random variables, showing in particular (Theorem 8.7) that 
if EN(T k ;t) — » EN(T;t) for each t and the values of EN(T;t) uniquely determine the distribution of 
T then T^ — >T. Also proved there is the somewhat less trivial fact (Theorem 8.8) that the values of 
~EN(T;t) uniquely determine the distribution of T under the growth condition: EN(T;t) < e c l'l for 
some c. (A sharper growth condition such as an analogue to Carleman's condition could be obtained 
but is not needed here.) What remains then, is to show that EN(Tk]t) — > EN(Vi;t) for each finite t 
and to verify the growth condition on EiY("Pi;i). 
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Begin by establishing 



For any finite rooted tree t, EN(U ; t) = 1, 



(13) 



where U is the tree of a Poisson (1) branching process rooted at some vertex j/o- Let zq be the root 
of t. Use induction on the height of t. If t is just z , then N(u;t) = 1 for any u, so the equation is 
trivially true. Now suppose z has s descendants Z\,...,z s for some s > and assume for induction 
that (13) holds for each of the subtrees ti, 1 < i < s rooted at z,. Let r > be the random number of 
descendants yi, . . . ,y r of yo. Conditional upon r, each of the subtrees Ui rooted at yi is the tree of an 
independent Poisson (1) branching process. Now any tree-map from t to U maps each ti to a distinct 
Uj. Thus N(U; t) = ^ E[j=i ^( u kj ', tj) where the sum is over all ordered sequences of distinct k\, . . . , k s 
chosen from among 1, . . . , r. Then by independence conditional on r, 



By the induction hypothesis each expectation is one, so the sum is just the number of ways of choosing 
the kj's. Thus EN(U;t) = Z) r (e~ 1 /'"!)(r) iS = E(X) S where X is a Poisson of mean 1. This is equal to 
1, establishing (13). 

Now we compute EN(Vi;t). Recall that V\ is a path Xi : i > to which has been added an 
independent Poisson (1) branching process, say Ui at each Xi. If / is a tree-map from t into Vi, there is 
a greatest i for which Xi is in the range of /. Let w(f) denote the vertex of t that maps to Xi for this 
greatest i. For each vertex z of t, we will show that the expected number of tree-maps / : t — > V\ for 
which w(f) — z is one. Indeed, if x = z , z\, . . . , z/, — z is the path from the root of t to z, then the 
tree-maps / for which w(f) = z are in one to one correspondence with the collections of maps fo, ■ ■ ■ , fk 
where /j maps the subtree ti of t rooted at Zi to the subtree of V\ rooted at Xi. Thus the number 

of / for which w(f) — z is Yii=o -^(^M*); But each 'Pi(i) is an independent Poisson (1) branching 
process, so by equation (13), the product is one. Finally, summing over the vertices z of t gives that 
EN(Vi;t) is equal to \t\, the number of vertices of t. This verifies the growth condition on EA^Pi; t) 
with miles to spare! 

To calculate EA r (T^; t), observe that any tree-map / : t — > T is also a tree-map from t to Gk where 
Gk is considered to be rooted at Vk- If u(f) denotes the image of / as a subtree of Gk, then we may 
write 



where the sum is over all tree-maps / : t — ► Gk- Fix /. Then P(u(f) C Tfe) = detM where M is the 
transfer impedance matrix for u(f) as a subgraph of G. By hypothesis M(e, e') = fc _1 (|e n e'| + o(l)). 



s 



EN(U; t) = 5>-7r!) ^ J] EiV (u fej ;t 3 ). 



r j = l 




(14) 
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Since / is injective, / _1 is defined on u(f) so |e n e'\ — |/ _1 (e) n / _1 (e')|, thus for any / the transfer 
impedance matrix for u(f) in Gk may be written as fc _1 P, where P is a matrix indexed by the edges of 
t for which P(e, e') = (\e(le'\ +o(l)) as fc — ► oo. Since the size of P remains fixed as k — > oo, Lemma 5.2 
with Oj = 1 for all i gives that detP is equal to the |f|(l + o(l)), and thus detM = fc 1_ l*l|f|(l + o(l)). 
Then the probabilities in equation (14) are all equal and the identity becomes 

EN(T k ;t) = N(G k ,t)k 1 -^\t\(l + o(l)). 

But N(Gk, t) = fcl*l _1 (l + o(l)). An easy way to see this is to consider building a tree-map / : t —* Gk 
starting at the root and working outwards, not worrying about injectivity. If / is defined on z then 
by hypothesis there are fc(l + o(l)) neighbors of f(z) and each descendant of z may be mapped to any 
neighbor of f(z). The fraction of all maps built this way that are injective goes to one as t remains fixed 
and k — > oo, so the total number of maps is fcl*l- 1 (l + o(l)). Now (14) becomes EN(T k -t) = |i|(l + o(l)), 
hence EA r (T^;t) — > EN(Vi;t) as claimed. Since the growth condition on EN(Vi;t) has been verified, 
this shows Tfc —*V\ . □ 



6 Entropy 

In this section we consider the entropy of the essential spanning forest process. 

The set of essential spanning forests is a closed shift- invariant subset of {0, 1} S ( G ) where E(G) is the 
edge set of G. The topological entropy (per vertex) of the essential spanning forest is defined to be 

H top = lim -r^-rlog(N B J 

rwoo \B n \ 

where B n is an increasing sequence of rectangular boxes (i.e. of the form C x S where C is a rectangular 
box in % d together with the induced edges) and Nb is the number of essential spanning forests of the 
induced graph B where a forest is essential if every component of the graph is required to touch the 
boundary of B. We may also consider boxes with boundary conditions, meaning a box B together with 
an equivalence relation = on the vertices of B that neighbor B c . An essential spanning forest on a box 
with boundary conditions is one that becomes a tree under the contraction map consisting B ^ Bj = 
(think of the boundary conditions as telling which vertices are connected by unseen edges in B c ). 

Notice that if B is the union of two boxes C and D then any essential spanning forest of B restricts to 
essential spanning forests in both C and D. This means that N B < NcN D so that \og(N B ) is subadditive 
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and the entropy is independent of the sequence of boxes chosen. There is a variational principle for the 
topological entropy of the essential spanning forest. 

Htop = sup{i?(/x) | /i is an invariant probability measure } 

where H (/x) is the Kolmogorov-Sinai entropy of /i per vertex with respect to the group of translations 
by TL d . See [Mis] for a short proof of this fact. 

Now let hg be the probability measure of uniform essential spanning forests on G. Using arguments 
described in [Pern] it is seen that if B n is any increasing sequence of rectangular boxes with arbitrary 
boundary conditions and if ji n is the measure that gives equal weight to each forest in B n in which each 
component of the forest meets the boundary of B n the fx n converges weakly to the translation invariant 
probability measure moreover this convergence is uniform in the boundary conditions. By uniform 
convergence we mean the following. Suppose that we are given a cylinder set and an e > 0. Then there 
is a box B, containing the cylinder, so that for any box C containing B and any boundary conditions 
on C we have that the uniform probability measure on essential spanning forests of C takes a value on 
the cylinder set that agrees with that of fia to within e. 

The principle content of this section is the following theorem. 

Theorem 6.1 (a) The measure fie of the uniform essential spanning forest process is the unique 
translation-invariant measure on the set of essential spanning forests whose Komogorov-Sinai entropy is 



where x{Q{ a )) * s the characteristic polynomial of Q(a) and the integral is over the d-torus with respect 
to Haar measure. 

Before proving the theorem we give some examples of (b) in which the entropy can essentially be 
read off. The case when S = {1}, i.e. k = 1 is especially easy to analyze because Q{a) is a 1 x 1 
matrix. In these cases the entropies are the same as entropies calculated by Lind et al [LSW] of some 
seemingly unrelated dynamical systems that can be represented as Bernoulli shifts on certain subgroups 
of (% d )^/'^') defined by peroidic linear relations. We are at a loss to explain this apparent coincidence. 

Suppose the origin is connected to M pairs of opposite vertices in G = "E d . Say the number of 
self-edges per vertex is I, though clearly this must drop out of the calculation. Suppose we denote 



top- 



(b) 
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representatives of these pairs by {x m : 1 < m < M}. Then D = 2M + I and 

M 

Q(a) = D~ x (l + ^2 2 cos(27ra • x m )). 
Then x(Q(a))(l) = 1 - Q{a) = D^(^i 2 - 2cos(27ra • x m )) and the entropy is 

. M 

/ log(2M - V 2 cos(2ttq: • aj m ))da. 

- 1 m— 1 

One such example is itself with nearest neighbor edge relation. The entropy is 

[ [ log(4 - 2 cos(27rai) - 2 cos(27ra 2 ))daida 2 w 1.166. 
Jo Jo 

Another is the triangle lattice. This has a representation as the nearest neighbor lattice on H? with 
added edges placed in the "southwest - northeast" diagonal of each square. 





In this case, ignoring self-edges, Q(a) = g(2 cos(27rai) + 2cos(27ra 2 ) + 2cos(27r(ai + a 2 ))) and the 
entropy is 

/ / log(6 — 2cos(27rai) — 2cos(2-7tq; 2 ) — 2cos(27r(ai + a 2 )))do:i(ia 2 w 1.61. 
Jo Jo 

These are the same as the entropies given in [LSW] for Haar measure on the subgroups of (R/S)^ 
consisting respectively of those configurations 4> for which 4(j)(x) — <f>(x + (0, 1)) — cf){x + (0, —1)) — 4>(x + 
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(1,0)) — 4>(x + (— 1, 0)) = and those configurations <ft f° r which 6</>(x) — <p(x + (0, 1)) — 4>{x + (0, —1)) — 
<f>(x + (1,0)) - <f>{x + (-1,0)) - cj>{x + (1, 1)) - <j>(x + (-1, -1)) = 0. 

The proof of Theorem 6.1 uses the following lemma on the stability of entropy under changes in a 
small percentage of the output of the process. 



Lemma 6.2 Let (fi,/i) be a Lebesgue probability space. Suppose that X — (X\ 1 X2, ■ ■ ■ ,Xjy) and Y = 
(Yi, Y2, . . . , Yn) are binary random variables such that for all u> € ft, jj={i\Xi 7^ Yi\ < K. Then 



\±H(X)-±H(Y)\<Klog(N)/N 

Proof: Let Z, t = l {Xi ^Y i }- Then H(X) < H(X,Z) = H(Y,Z) < H(Y) + H{Z). By symmetry we see 
that \H(X) - H(Y)\ < H(Z). But by counting H(Z) < log(N K ) proving the lemma. □ 

Proof of Theorem 6.1: Let B n have arbitrary boundary conditions and let B n have the same vertex set 
but with unconnected boundary conditions, i.e. the equivalence relation consists of singletons. If jl n 
gives equal probability to each spanning forest of B n in which each component touches the boundary 
then let v n be the measure concentrated on spanning trees of G obtained as follows. First partition the 
vertex set of G with translates of B n and put independent copies of \i n on each of these translates of B n . 
Then add (at most 0(n d_1 )) edges in each translate of B n to make a tree that spans this translate. Then 
connect each of these trees in the translates by a translation-invariant path similar to those constructed 
in [BK2] . This random procedure produces a random spanning tree on G with two ends whose measure 
is denoted v n . It may be made translation- invariant by averaging the distribution over all shifts by % d 
shifts in B n . By the lemma above we see that for each n 

1 tf(^)<l^) + 0(^ 



Likewise it is seen that 



lim "TETT^^™! B n) = H top 



where \i n is the uniform measure on spanning forests of B n in which every component touches the 
boundary. A similar argument together with the subadditivity of entropy gives us that H(^ig) — H top . 

Further, if \i is any ergodic translation-invariant probability with H(p) — H top then we can show 
that fi — fi G . Fix a rectangular box B. Consider a much larger C around B. Condition on the /i-outside 
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of C and record the boundary condition x = y iff x and y are connected by a path in C c . Since /i 
has maximal entropy the conditional distribution of fj, on B is the same as the distribution of uniform 
essential spanning forests with these boundary conditions. (Were this not true we would be able to 
modify [i within such boundaries and force the entropy to be strictly larger.) If the outside box is large 
enough then we see (again using the arguments in [Pern] ) that the conditional distribution of /j, on 
B is very close to the distribution of \iq on B uniformly in the boundary conditions. Integrating this 
/^-conditional distribution with respect to the /i-outside of the large box gives us that /i-distribution on 
B is very close to the /zc-distribution on B. Taking limits gives /i = jio- This proves part (a) of the 
Theorem and leaves only the computation in (b). 

Now we consider finite subgraphs with periodic boundary conditions. Consider the toral graph 
(S„) d = {1, 2, n} d with nearest neighbor relation taken modulo n. Our vertex set will be (S„) d x S 
with incidence matrix 

M n ((x,i),(y,j))=Ry- x (i,j) 
where the y — x is taken mod n and n is assumed to be large enough that |ar| > n implies R x = 0. 

Let Nb = the number of spanning trees on this graph. We have shown above that H top = lim —j— log( Nb )• 

n a k 

To complete the proof of the Theorem we use the Matrix Tree Theorem to compute Ns n - This 
theorem ( [CDS, page 38] ) says that if we have a D-regular connected graph with L vertices and if the 
eigenvalues of the incidence matrix are A l7 A 2 , . . . , Xl-i,^l = D then the number of spanning trees of 
the graph is 

L-l 

So it is enough to compute the eigenvalues of the matrix M n . Given a = (—,..., — ) for aj € TL n 
(which we may view as an element of T d ) suppose that X(a) is an eigenvalue of Q(a) with eigenvector 
v(a). Then DX(a) is an eigenvalue of M n with eigenvector v(a) ® £ Q . 

This is checked analogously with Lemma 2.1. 

^M„((x,i), (y,j))v(a)j cxp(27ria • y) 

3,V 

= ^ R v ^ x (i 1 j) exp(27r?a • (y — x))v(a)j exp(27ria • x) 
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DQ(a)i.jv(a)j cxp(2iria ■ x) 



Now D ■ Q(a) is Hermitian, in particular it eigenvectors span and the characters exp(27ria • x) 
span C " we see that we have found a complete contingent of eigenvalues. 



A, a 

= ifn^iK 1 -^))^- 1 ) (na-Mo))] 

a^O A^l 

Continuing and ignoring some logarithmically insignificant terms we get 

Htop = 1 T^ l0g( ^ B " ) 



log( J D fe XQ( a) (l))da. 



1 

fc 

The last equality follows from approximating the integral by Riemann sums. Because of the estimate 
of the eigenvalues from Lemma 2.4 we see that |xQ(a)(l)l < K\a\~ 2k so that the Lebesgue integral is 
finite. On |a| > e all the terms in the sum for all n are bounded, so by bounded convergence, the 
summations (qua integrals of step functions) converge to the integral. On < \a\ < e, all sums and the 
integral go to zero as e — > 0, implying the desired convergence. □ 



7 Dominoes 



A domino tiling for a graph G, otherwise known as a perfect matching or a 1-factor, is a collection of 
edges of G the disjoint union of whose vertices is V(G), the vertex set of G. There is a correspondence 
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between spanning trees of a planar graph and domino tilings of a related graph which we now describe. 
Let G be a nice planar graph with vertex set Vg and edge set Eg, "nice" meaning here that every vertex 
has finite degree and every face including the exterior face is bounded by finitely many edges. Since G is 
planar there is a dual graph G* with vertices Vg* and edges Eg*- In rough terms, G* is obtained from 
G by putting a vertex at each face of G and joining two such vertices by an edge if their corresponding 
faces in G meet at an edge. The set Vg* is identified with the faces of G and the edge sets Eg and Eg* 
are also identified. We construct a new bipartite graph G whose vertex set is the union of Vg, Vg* and 
Eg- There is an edge of G joining v € Vg and e € Eg if and only if e is incident to v. Likewise v € Vg* 
and e £ Eg are joined by an edge if v is a vertex of the edge in G* identified with e. Both G and G* sit 
inside of G in the sense that G = G/Eg — Vg* and G* = G/Eg — Vg (recall this notation from Section 2 
for contraction and deletion of a graph). Here is an illustration of this where G is the triangular lattice 
(vertices are filled circles), G* is its hexagonal dual (open circles) and the extra vertices of G are the 
crossing points. 



There is a natural correspondance between subgraphs of G and subgraphs of G* . If T is any subgraph 
of G let T* be the subgraph of G* obtained by declaring an edge e* in T* if and only if the corresponding 
edge e is not in T. Clearly this is also a dual operation so T** — T. We record an easy lemma on dual 
trees. 

Lemma 7.1 (a) Let G be a finite planar graph. Then T* is a spanning tree of G* if and only if T is a 
spanning tree of G. 
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(b) Let G be an infinite planar graph all of whose faces are bounded regions. Then T is a one ended 
spanning tree if and only if T* is a one-ended spanning tree. Also T is an essential spanning forest if 
and only if T* is an essential spanning forest. □ 



Remark: This lemma and its soon to be described connection with domino tilings were noticed inde- 
pendently by Jim Propp who suggests the name "Temperleyan" for graphs that are isomorphic to G 
for some G. Temperley [Tern] first used this trick in the case that G was H? . We also learned of this 
independently from Piet Kastelyn (personal communication). 

Define a directed essential spanning forest to be a spanning forest together with a choice of an end 
for each component. Think of edges of a directed spanning forest being oriented toward this end. If T 
and T* are dual essential spanning forests of a nice infinite planar graph, say the pair (T, T*) is directed 
if an end has been chosen of each component of T and of T* . For any nice infinite planar graph G, we 
now describe a bijection between domino tilings of G and directed pairs of essential spanning forests of 
G and G*. 

If (T, T*) is such a pair, then let *(T,T*) be the domino tiling A C E(G) such that 

(i) the edge from v £ V(G) to e e E(G) is in A if and only if e £ T and is oriented away 
from v, and 

(ii) the edge from v* £ V(G*) to e £ E(G*) = E(G) is in A if and only if e £ T* and is 
oriented away from v* . 

It is easy to verify that A is a domino tiling: each vertex v £ V(G) is in precisely one edge of A, 
corresponding to the unique edge in T out of v; similarly each v* £ V(G*) is in a unique edge of A; and 
each e £ E(G) is in a unique edge of A since e is in precisely one of T, T*. Conversely, If A C E(G) is a 
domino tiling, then each edge / £ A connects some e £ E(G) either to some v £ V(G) or some v* £ G* . 
Let $(/) be the edge e in either G or G* accordingly and orient it away from » or w*. Then the collection 
of all {$(/) : f £ A} is the union of a subgraph G' of G and the corresponding dual subgraph G'* of 
G*. If G' has a loop, then inside the loop is a conponent of G'* . Starting anywhere in this component 
of G"* and following the orientation creates a loop since the component is finite. This loop encloses a 
loop of G' inside the original loop. But this cannot continue forever, whence G' (and G'*) has no loop. 
Thus G' and G'* are essential spanning forests with each component directed toward an end. 
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Write II for the map that takes a directed pair of ESF's (T, T*), and forgets about T* and about the 
arrows, producing the undirected ESF T. Then we have established a correspondence 

$ n 

DOMINO TILINGS ^ DIRECTED ESF's — ► ESF's. 

Now the projection II from directed dual pairs of essential spanning forests to essential spanning 
forests that forgets the orientation and T* is not in general one to one, but by Lemma 7.1, if T is a 
one-ended essential spanning tree of a nice planar graph, then so is T* , so there is no choice to be made 
in orienting the components of T and T*. Now fix a S 2 -periodic planar graph G, so the vertex set is 
H? x S where S = {1, . . . , k}. There is then a well defined map "J" o n^ 1 from one-ended spanning trees 
of G to domino tilings of G (which is also Z 2 periodic). The uniform spanning forest measure, v on G is 
supported on the set of one-ended trees [Pern] so the above correspondence gives a transported measure 
v on domino tilings of G. 

Theorem 7.2 The measure v defined above is the unique measure of maximal entropy amongst all shift 
invariant probability measures on domino tilings and its entropy per vertex is kH{v)/2e where H(u) is 
the entropy of v and e is the number of edges per fundamental domain. 

Remark: This theorem works because boundary conditions are, as we have seen in the previous section, 
irrelevant for trees. Since different boundary conditions for domino tilings give different entropies [Elk, 
Kas, TF], the excursion through trees is the only soft method we know to get uniqueness of the maximal 
entropy measure for domino tilings of Temperleyan graphs. Schmidt [Sch, page 58] cites an argument 
by Kuperberg that is supposed to prove this for r E? . 

Proof: We have seen that v is well defined and it is evidently S 2 -invariant, so it remains to prove 
the assertions about its entropy. Suppose that ft is a translation invariant probability measure on 
domino tilings of G. This may be transported to a measure /U on essential spanning forests of G by 
/j,(B) = /i[\I>[II _1 [£?]]]. We show that the entropy per fundamental domain is preserved. First, note that 
H is translation invariant so with probability one the components of the essential spanning forest have 
one or two ends [BK1]. There is only one way that a one-ended tree may be paved with dominoes and 
there are two ways a two-ended tree may be paved. Thus the ambiguity in determining the domino tiling 
is one bit for component two-ended tree in the forest in G plus one bit for each two-ended component 
in the dual spanning forest of G* . Since there are 0(n) such components in every box of side length n 
which has on the order of n 2 vertices we see that the entropy of \i and jj, are the same. 
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Now H(fi) per fundamental domain = H(p-) < H{v) = H(i>) per fundamental domain with equality 
only when fj, = v . But v is concentrated on one-ended spanning trees [Pern] and hence v is the only 
measure which transports to v, which establishes that v is the unique measure of maximal entropy on 
domino tilings. 

Finally, recall k is the number of vertices of G in each fundamental domain. Let e be the number of 
edges and / the number of faces, in the sense that a box {1, . . . , n} d x {1, . . . , k} will have approximately 
(1 + o(l))fn d faces completely contained in it as n — > oo. Euler's formula applied asymptotically says 
that k + f = e. The entropy of the domino process on G is the same as the entropy of the spanning tree 
process on G (and as the spanning tree process on G* ) when measured per fundamental domain. We 
have given the entropy formula for the spanning tree process per vertex. To convert this to the entropy 
of the domino process on G per vertex we must multiply by k/(k + e + /) = kj (2e). □ 

Corollary 7.3 There is a unique measure of maximal entropy on domino tilings of the 2-lattice Z 2 . 
and its entropy is 1/4 the entropy of the spanning tree process on Z 2 . 

Remark: This entropy number was first calculated by Kastelyn [Kas] in 1961 as the exponential growth 
rate of the number of tilings of large rectangle or torus. Since these are atypical boundary conditions 
this does not necessarily prove that this is the largest entropy possible. 

Proof: In the theorem take G = Z 2 so that G* is isomorphic to Z 2 and G is also isomorphic to Z 2 . 
The uniform spanning tree measure v on G induces a measure v on domino tilings of G that has a 
fundamental domain of four vertices. The measure on G is invariant under the induced TZ? action; since 
there a four vertices of G in a fundamental domain of G, this is a subgroup of index 4 in the usual group 
of translations of G. Actually, though it must be invariant under all graph automorphisms a including 
90° rotations, since otherwise v o a would be a measure on domino tilings distinct from v but with the 
same entropy, violating the uniqueness shown in Theorem 7.2. □ 

The fact that II is not in general continuous leads to a problem in trying to compute f.d.m.'s of 
i>. If C = {T : ei, . . . , e/c G T} is a cylinder event in the space of essential spanning forests, then 
*[II _1 [C]] is a finite union of cylinder events in the space of domino configurations. On the other 
hand, if C = {A : e\, . . . , € A} is a cylinder event in the space of domino configurations, then C 
is not necessarily VPfll - 1 [C]] for some elementary cylinder event C on essential spanning forests. Thus 
knowledge of the f.d.m.'s of v would yield the f.d.m.'s of v quite directly, but unfortunately not vice 
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versa. To illustrate this, let C be the event of finding a square of two vertical dominos with the origin 
at the lower left corner. 



vertices of G are open circles 
vertices of G* are filled circles 



Then $[C] is the event that there is an oriented edge from upward from the origin in T and a dual 
edge in T* oriented downward on the right of the origin. 
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The corresponding event on trees is that the edge upward from the origin be in T, that the path from 
the origin to infinity be through that edge, that the edge leading right from the origin not be in T and 
that the path connecting the origin and the point to the right go over the top, rather than around the 
bottom (speaking homotopically in the plane minus the edge leading right from the origin). We do not 
know how to compute this probability. There are however some cylinder domino events corresponding 
to events whose probabilities we do know how to compute. Here is one example. 

Consider the following contour which can be broken down into dominos in the four ways shown. 
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We may calulate the probability of finding this contour with the origin at the bottom left. Since v is 
uniform on the interior of any box given the boundary, each of the four configurations inside then has 
1/4 this probability. To carry this out, map by $ so as to get the following four configurations of directed 
edges. 



A x 



Here, A\ is the event of there being oriented edges in T leading up out of the origin, right, and back 
down, while there also being a downwardly directed edge in T* on the right of the origin. Since this 
dual edge is implied by the the other three, it is not shown. A similar thing happens with A2,A$ and 
A4. Now A\ U A 2 U A 3 U A4 is the event of T containing three out of the four edges of the square with 
the origin at its lower left, and having the path from this square to infinity exit the square at the lower 
right. Then P(A) = P(A) = P(Ai U4 2 Ui 3 U A 4 ) which is by symmetry just 1/4 times the probability 
of T containing three of these four edges, with no specified orientation 
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which is just 1/4 the probability of the vertex in the center of the square being a leaf of T*, which is 
2ir- 2 - 4ir- 3 w .0736 from Section 5. 



The examples which work out this nicely are a small finite class. There is another class of examples 
of f.d.m.'s we can calculate. We give just one illustration, since the taxonomy is still being worked out. 
Consider the following pair of dominos and corresponding set of oriented edges of T. 



The probability of these two oriented edges is the probability of T containing the two unoriented edges 
times the conditional probability given that of the path to infinity leaving through the point at the upper 
right. The first probability is computed by transfer impedances to be 

1/2 1/2-1/tt _! _ 2 

= TT — TT 

1/2-1/tt 1/2 

The conditional probability may be seen to be the probability that a random walk coming in from 
infinity hits the set of three vertices first at the upper right. The hitting distribution from infinity on a 
set of vertices x\, . . . , Xk is proportional to the k entries of (1, ... , 1)M _1 where M is the Green's matrix 
i.e. My = H(xi,Xj) (proof: use a last exit decomposition from {xi, . . . ,x k } and then invert the linear 
relations). Then the conditional probability in question is tt/(6tt — 8), which gives a total probability of 
(l-7r- 1 )/(67r-8) w .0628. 



39 



8 Appendix 



8.1 The classical Green's function 



Let H(x,y) be the classical Green's function for G defined by 



oo 



H{x,y) = Y,'P{SRW x {n)=y) 



(15) 



when d = 3, and 



DC 



H(x,y) = Y,lP(SRW x (n) = y) - P{SRW x (n) = x)] 



(16) 



n=0 



when d = 1 or 2. 

Theorem 8.1 The sums in (15) and (16) converge. Furthermore, H has the following properties: 

(i) (H) is symmetric; 

(ii) H(x, •) is harmonic except at x, where its excess is 1; 
(Hi) H is bounded if d > 3; 

(iv) H(x, •) — H (y, •) is bounded for fixed x,y if d < 2. 
Remark: Theorem 4.1 follows immediately from this and Corollary 3.3. 

Proof: Begin with the observation that SRW is transient if d > 3 and recurrent if d < 2; there are 
many ways, to see this, one being to watch SRW G only at the times when it hits S d x {1} which 
is then a symmetric random walk on Wi d with P(x, y) having exponential tails. When d > 3, the 
theorem is now easy to prove. The sum converges by definition of transience. Writing P(SRW x (n) = y) 
as the sum of D~ n over paths of length n from x to y shows by path reversal that this is equal to 
P(SRW y (n) = x) for each n, hence H is symmetric. Boundedness follows from the fact that H(x,y) = 
P(SRW X hits y)H(y,y) < H(y,y), and from the fact that H(y,y) takes on only k different values. 

Assume now that d < 2. Since SRW G is recurrent there is a a-finite stationary distribution /j,, unique 
up to constant multiple [IM]. It is easy to see that this is uniform. Furthermore, it is well-known [IM] 
that for any x,y,z G G, the ratio of Cesaro averages converges: 




(17) 
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as N — > oo. Now fix x,y,z and consider the Markov chain {Z{n) : n > 1} on the space {x,y} gotten 
by looking at SRW Z only when it at x or y. In other words, Z(n) = x if the n* 71 visit of S'iiW^ 

to {x,y} is at x and Z(n) = y otherwise. The transition matrix for Z is ^ ^ ^ ^, where 

a = P(SRW X hits x before y) and b = P(SRW y hits y before x). It follows easily from (17) that the 
stationary distribution for Z must be half at x and half at y, from which it follows that a = b in the 
transition matrix. 

It is easy to calculate 

oo 

J2 \P(Z{n) = x)- P(Z(n) = y)] = [P(Z(N) = x) - P(Z(N) = y)}/(2 - 2a). (18) 

n=N 

Now for any positive integers L < M, we have 

M 

J2mZ(n)=x)-P(Z(n) = y)} 

n=l 
L 



= 5^[P(Si2W z (n) = s) - P(Si2W z (n) - y)] 



n=0 

T 

+E E( ^ I(SRW z (n) =x)- I{SRW z {n) = y) \ SRW Z {L + 1)) 

n=L + l 

where r is the time of the M th visit to {x,y}, and letting M — > oo while using (18) gives 

L 

J2[P(SRW z (n) =x)- P(SRW z (n) = y)] 

n=0 

= [2P '{SRW Z hits x before y) - l)/(2 - 2a) 

-tP(Si2W z (ri) = a:) - P(5iiW,(r L ) - y)]/(2 - 2a), 

where rj, is the first time after L that S-RW^ hits {x,y}. The last term is converging to zero as L — > oo, 
hence letting z = a;, the sum in (16) converges. Moreover, when z = x the sum converges to 1/(2 — 2a), 
and having shown that a = b in the transition matrix, we see that this is symmetric in x and y, 
proving (i). Along with the relation P(SRW x (n) = y) = P(SRW y (n) = x), this also establishes that 
Y,n=oi P ( SRW *( n ) = x ) - P(SRW y {n) = y)] = 0. From the fact that P(SRW x (n) = w) -» for any w 
and from the relation 

N N-l 

J2[V(SRW x (n) = y)~P{SRW x {n) = x)} = S^ + D' 1 ]T ]T [P(SRW x (n) = z)-P(SRW x (n) = a;)] 

n— z^y n—0 



41 



it now follows that H{x, ■) is harmonic except at x and has excess 1 at x. 

Finally, to check that H(x, •) — H(y, •) is bounded for fixed x and y, use ^2^ =0 [P (S RW x (n) = x) — 
P(SRW y (n) = y)} = to conclude that H(x, z) - H(y, z) = J2n=oi P ( SRW z( n ) = x) - P(SRW z (n) = 
y)\. This is just (2P(SRW Z hits x before y) — l)/(2 — 2a), and the numerator is bounded between — 1 
and 1, which proves that H(x, ■) — H(y, •) is bounded. □ 

8.2 Harnack lemmas 

Lemma 3.1 is developed in [Lai] through a series of theorems beginning with a local central limit theorem 
for SRW on We first remark that [Lai, Theorem 1.2.1] actually holds for the following more general 
random walk. Let {X n : n > 0} be an irreducible aperiodic random walk on TL d with symmetric 
transition probabilities (i.e. P(x,x + a) = P(x,x — a)) that decay exponentially (i.e. P(x,x + a) = 
0(e~ c l°l)). Then the characteristic function for X n is still given by <j)(0) n where (j)(9) is real and equal 
to 1— < 8,9 > +0(< 8,9 > 2 ) near zero for some positive definite form <, >. Then the proof of [Lai, 
Theorem 1.2.1] gives 

Theorem 8.2 (Local CLT) Under the above assumptions on X n , there exists a C > and a positive 
definite form <, > for which P„(x, y) — Cn~ d / 2 e~ <x ~ y/x ~ y> l 2n {\ + 0(min(n _1 , < x — y,x — y >))). □ 

Now let {Y n : n > 0} be a SRW G started at the point (0, 1). Write Y n = (X n , Z n ), where X n e T 1 
and Z n is the projected RW on S. It can be shown that X n and Z n are exponentially asymptotically 
independent in the sense that the joint distribution of X n and Z n is within e~ cn in total variation of the 
product distribution with the correct marginals. Applying Theorem 8.2 to a time change of X n , it can 
be shown that X n obeys the same local central limit theorem, the correction for the time change being 
smaller than the error bounds in the CLT. This gives 

Theorem 8.3 (Local CLT for G) Let Y n be a SRW G . Then there exists a C > and a positive 
definite form <, > for which 

P n ((x, i), (y,j)) = C n- d / 2 e-< x -y^-y>/ 2n {l + C^min^" 1 , < x - y, x - y >))). 

□ 
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This is sufficient to establish part (i) of Lemma 3.1 along the following lines, as pointed out to us by 
Maury Bramson (personal communication). For x E B n and z € dB m , v Bm (z) is the sum of probabilities 
of paths starting from x and hitting dB m for the first time at z. Reversing the paths, shows that this is 
the expected occupation of x by a SRW starting from z and killed when it hits dB m again. 

First suppose d > 3 and fix e > 0. Then the local CLT for G allows us to pick L > n large enough 
so that for w € dB^, the occupation measures at x and y for SRW W will be within a factor of 1 + e of 
each other for any x,y £ B n . If m is then chosen large enough, the occupation measure for SRW W at 
any point in B n will be at most 1 + e times the occupation measure for SRW W killed upon hitting 3B m . 
Now use the Markov property to write the occupation measure at x for SRW Z killed upon hitting dB m 
as a linear combination over w of the occupation measure at x of SRW W killed upon hitting dB m . This 
shows the measures at x and y to be within a factor of (1 + e) 2 , and since e was arbitrary this establishes 
the Harnack principle (i). 

On the other hand, if d < 2 then SRWq is recurrent, then for any e > and n there is an m large 
enough so that P(SRW X hits y before dB m ) > 1 - e for all x, y e B n . Then u B m > (1 - e)v Bm for all 
x,y G B n , establishing (i). 

The remaining parts of the theorem are derived as follows. To get (ii) from (i), pick x G dB n and 
write 

v% m = P(SRW X does not return to B n )p BmB " + P(SRW X returns to B n )v' 

where v' is a mixture over y € dB n of v Bm , the mixing measure being given by the return hitting 
distribution of SRW X on dB n . Since B n is held fixed, v'({z})/v Bm ({z}) is converging to 1 uniformly in 
zasm-> oo. Since P(SRW X returns to B n ) is bounded away from one (for fixed B n ), solving for p Bm 
gives that sup z p x m /v Bm — *• 1 as m — > oo. To get (if) from (ii), restate (ii) as saying that the sum, call 
it w(x, z,n,m), of £H 7 I over paths 7 from x S B„ to z S B m that avoid 9_B„ and dB m except at the 
endpoints is equal to (1 + o(l))f(x)g(z, m) and functions /, g as m — > 00. The restatement of (iu) is 
easily seen to be identical by time-reversal. To get (in) from (iv), just note that u^ n is a mixture over 
yed(B c m ) of Py n ^ m . 

Finally, to get (v) choose L > n so that i3_L contains all the contracted edges. For (i) and (ii), write 
v% m and p x mBn as a mixture over y g 8Bl of p BmBh and observe that p BmBL for the contracted graph 
is equal to p BmBh for the uncontracted graph, so that all the measures being mixed are identical up to 
a factor of (1 + o(l)). For (Hi) and (iv), write v Bn and p x nBm as a mixture over y € 8Bl of z^ n and 
observe that this time it is the mixing measures, which are just p x LBm that are all within a factor of 
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(1 + o(l)) as x varies with m — > oo. 

8.3 Convergence of probability measures on trees via moments 

Propositions 8.4 - 8.7 are adaptations of classical tightness criteria to the setting of tree- valued random 
variables. The development is brief and essentially copied from [Du]. Theorem 8.8 is less trivial and the 
proof is given in full detail. 

Fix a positive integer r until further notice and restrict attention to trees of height at most r. Recall 
the definitions of \t\, t Ar and N(u;t) from Section 5. We say a family of probability distributions {P n } 
on such trees is tree-tight if for all e > there is a K for which limsup n P„(|T| > K) < e. 

Proposition 8.4 // {P„} is tree-tight then every sequence of measures from {P„} has a subsequence 
that converges in distribution to a probability measure. 

Proof: Since the P„ laws of |T| are tight in the usual sense, every sequence has a subsequence P nj for 
which the laws of |T| converge in distribution to some probability measure. For each k, the conditional 
distribution (P nj | |T| = k) is finitely supported, hence has a subsequence converging to a probability 
measure, and diagonalizing over k gives the desired subsequence. □ 

Proposition 8.5 Let g, h be functions from trees of height at most r to the reals. Suppose that g > 
and that h(t)/g(t) — > as \t\ — > oo. Let P„ be probability measures on these trees with P n — ^Poo 

and 

limsup n E„g < oo, where E„ is expectation with respect to P„. Then E„/i — > E^/i. 

Proof: |E„/i-E 00 /i| < |E„/i/(|T| > X)-E co W(|T| > K)\ + \E n hI(\T\ < K)\ + |Eooft/(|T| < K)\. The 
last two of these can be made small uniformly in n by using \h\/g — > and choosing K large enough. 
The first goes to zero for any fixed K. □ 

Proposition 8.6 Suppose P„ — >Poo and for each tree t, E n N(T;t) converges to a finite limit m(t). 
Then E QO iV(T;t) = m(t) for all t. 
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Proof: For any tree t of height at most r, let C(t) be the set of all t' of height at most r that extend t 
by adding to some vertex in t a single finite chain of descendants. Let 

g t (u)= N ( u ^')- 
t'ec(t) 

Notice that gt{u) > (\u\ — \t\)N(u;t) since each tree-map <p '■ t — > u can be extended to a tree-map of 
some t' G C(t) into u in at least as many ways as there are vertices in u \ Image (0). Then 

limsupE„ 5t (T) < hmsupE„iV(T;t) < ^ m(t) < oo. 

" t'ec(t) " t'ec(t) 

Applying the previous proposition with g = g t and h = N(-;t) finishes the proof. □ 

Proposition 8.7 Suppose E„7V(T; t) — > E^A^T; t) < oo /or all t and that P^ is uniquely determined 
by the values of E^A^T; t). Then Pn^Poo. 

Proof: First notice that {P„} is tree-tight: letting tj be a single chain of j + 1 vertices, |T| = 
Ej=o-^( T ;*j')' so hmsup„E„|T| = limsup„ ^=o E^V(T; ^) < oo by hypothesis, and using P(|T| > 
k) < fc _1 E|T| establishes tightness. Now each sequence in {P„} has a subsequence converging to a 
probability measure G and the previous proposition shows that E n N(T;t) — > EaN(T;t) for each t. 
Then G = P^ by the uniqueness assumption. □ 

For a positive integer- valued random variable X, the moments of X determine its distribution at 
least under a condition on the rate of growth of these moments. The remainder of the development 
of the method of moments for trees is to prove the following analogous fact for for tree-valued random 
variables. Let U be a random variable taking values in the space of locally finite rooted trees of height 
at most r. 

Theorem 8.8 Suppose EN(U;t) is bounded by e fc l*l for some k. Then the law of U is uniquely deter- 
mined by the values of~EN(U ;t) as t varies over finite rooted trees of height at most r. 

The proof is based on the following version of the integer-valued case. Let (A) s = A(A — 1) ■ ■ ■ (A — s + 1) 
denote the s th lower factorial of A. 

Lemma 8.9 Suppose Y e and X are random variables and for some fixed s > suppose the values 
Cj = ~EX(Y) s+ j exist for j > and are bounded above by e fej for some k. Then EXI(Y = s) is uniquely 
determined by the values of the Cj 's. 
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Proof: Let Z = X(Y) S , so that cj = EZ(Y - s) ■ ■ ■ (Y - s - j + 1). Then the values dj d =EZ(Y - s) j 
are determined from the values of Ci,i < j by linear combination. The coefficients arc bounded by some 
exponential e^ , so the dj are all bounded by some exponential e kj . Let 

h(t) = ^(itydj/jl = EZe u{Y - s l 

j>0 

The power series converges for all t, uniformly for t e [0, 2ir] and hence 

/■2-7T 

EZI(Y - s = 0) = (1/2tt) / h(t)dt, 

Jo 

which yields 

/•2-it 

EXI(Y = s) = (1/2tts!) / h(t)dt. 

Jo 

□ 

Proof of Theorem 8.8: The proof is by induction on r. The induction will in fact show that for any 
random variable / and any r, if U has height at most r then the values of EN(U;t)f as t varies over 
trees of height at most r determine the values of EI(U = t)f, provided that EN(U;t)f is bounded by 
e k\t\ f or some fc. us i n g / = 1 will then prove the theorem. The initial step r — is trivial, since then 
U A r is always a single vertex. 

Assume then for induction that the theorem is true for some r. For any tree t let b(t) be the number 
of children of the root of t. Define a random vector Z = [Zy , . . . , ZE V -,) whose length is always b(U) and 
whose distribution conditional upon b{U) is uniform over all b{U)\ permutations of the b(U) subtrees 
below the children of the root of U. In other words Z is this multiset of subtrees presented in uniform 
random order. 

For any i and s with s > i > and any t\, . . . , t s of height at most r, let 

9l = gi (t u ...,t s )= I(ZV =h)- ■■I(Z¥ = U)N(ZY +1 ;t i+1 ) ■ ..N(ZY;t a )I(b(U) = s). 

We set up a second induction on i to show that for any tj , . . . ,t s of height at most r and any random 
variable /, the value of Egi(t\, . . . , t s )f is uniquely determined by the values of EN(U; t)f. 

For the initial step i = 0, choose any t\, . . . ,t s and / and write t* for the tree whose root has s 
children with subtrees t\, . . . ,t s . Write t*j for a copy of t* to which has been added j leaves that are 
children of the root. Observe that a map from t*j into U is given by choosing s ordered distinct subtrees 
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ui, . . . ,u s from children of the root of u, mapping each U into m,, and then choosing an ordered j-tuplc 
of vertices from the remaining b(U) — s children of the root of U. Thus 

N(U;Uj)= N(u 1 ;t 1 )---N(u s ;t s )(b(U)-s) 1 

where the sum is over ordered s-tuples of subtrees from distinct children of the root of U. For each 
(ui, . . . ,u s ), P(Z U ...,Z a ) = (ui, ■■■u s ) = l/(b(U)) s . Consequently, 

EN(Z 1 ;t 1 )---N(Z a ,t a )(b(U)) a+j f 
= EN(U;U d )f. 

As j varies, \t*j | increases linearly with j. By the hypothesis of the theorem this implies that EN(U; t*j)f 
is bounded by e ■? for some k. Then we may apply Lemma 8.9 with Y = b(U) and X = ~EN(Zi; t\) ■ ■ ■ N(Z a , t, 
to get that these expectations uniquely determine Eg — N(Z\;t\) ■ ■ ■ N(Z 8 ; t a )I(b(U) = s)f. 

Now assume for induction that for any ti,...,t 8 , Egif is determined by the values of EN(U ; t)f and 
write 

E. 9m = E/(Z i+ i = t l+1 )[I(Z 1 = ii) • J(Zi = U)N(Z i+2 -t i+2 ) ■ ■■N(Z s -t s )I{b{U) = s)f]. 

We may apply Lemma 8.9 with Y = N{Zi+\\ U+i) and X being the rest of the RHS, provided EX(y) s +j 
is bounded by for some k. But ~EX(Y)j < EN(U;T)f where T is a tree whose root has children 
with subtrees: j copies of t i+2 and one copy of t\,...ti and t i+ i, . . . , t a . Since we have assumed that 
~EN(U;t)f is bounded by some e fc '*', ~EN(U,T)f must be bounded by some e*- 7 , whence the lemma 
applies and Eg i+ i is indeed determined, completing the induction on i. 

Setting i = s and / = 1 now showns that for any s and any t\, . . . t a , P(b(U) = s, Z^ =t\,..., Z^ = 
t a ) is determined by the values of EN(U;t). But this probability is just C(i*)/s! times P(U = U), 
where C(t*) is the number of permutations of {1, . . . s} for which t, = for all i. Thus P(U = t*) 
is determined for an arbitrary of height r + 1, so the induction on r is completed and the theorem is 
proved. □ 
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